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METHODS USING GENETI C PACKAGE DISPLAY FOR DETECTING AND IDENTIFYING PROTEIN-PROTEIN 
INTERACTIONS 



TECHNICAL FIELD 

5 This invention relates generally to genetic package display (e.g., phage 

display), and in particular, to selection of ligands that bind to a cell surface receptor 
and internalize useful in gene therapy, screening, and in various methodologies. The 
methods described herein are also referred to as Ligand Identification Via Expression 
or K UVE™ H . 

1 0 BACKGROUND OF THE INVENTION 

Bacteriophage expressing a peptide on its surface has been used to 
identify protein binding domains, including antigenic determinants, antibodies that 
are specifically reactive, mutants with high affinity binding, identify novel ligands, 
and substrate sites for enzymes. In its most common form, a peptide is expressed as a 

1 5 fusion protein with a coat protein of a filamentous phage. This results in the display 
of the foreign protein on the surface of the phage particle. Libraries of phages are 
generated that express a multitude of foreign proteins. These libraries are bound to a 
substrate or cell that presents the binding partner of interest. This screening process 
is essentially an affinity purification. Bound phage are recovered, propagated, and 

20 the gene encoding the foreign protein may be isolated and characterized. This 
technology is commonly referred to as "phage display." 

Through a process called "biopanning," the specific phage carrying a 
peptide or protein that interacts with a protein or other moiety on a solid phase can be 
identified and isolated. However, in many applications, binding or binding affinity is 

25 not the sole critical parameter. For example, in gene therapy, a gene sequence needs 
to be introduced into a cell. In preferred methods, the gene sequence is targeted to 
particular cells by way of a ligand / cell surface receptor interaction. Thus, the ligand 
must not only bind to the cells but must also be internalized. A native ligand that is 
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internalized, when used in a system for gene therapy may not be efficiently 
internalized. For example, both FGF2 and EGF are internalizing ligands. 

Phage libraries can be screened for potentially internalizing ligands by 
biopanning on live cells and rescuing internalized phage from the cells after stripping 
5 off externally bound phage (e.g., acid elution). However, this method may result in 
recovery of undesired phage that bind very tightly or are only partially internalized. 
Moreover, phage that are internalized and subjected to proteases lose infectivity and 
can not be recovered. Accordingly, current methodologies are inadequate to 
determine the usefulness of ligands for gene therapy. 

10 Further, identification of target cells or tissues that are able to 

internalize ligands and express a transgene would readily allow one to identify 
specific target cells for known or putative ligands as well as allow one to identify 
ligands for specific cell or tissue types. However, current methods of target cell 
identification are hampered by the same difficulties, as noted above, with regard to 

IS screening for internalizing ligands. Accordingly, current methodologies are 
inadequate to determine which cell or tissue types are useful targets for ligand 
mediated gene therapy. 

Thus, current screening methods are inadequate for selecting peptide 
or protein ligands that bind to a cell surface receptor and internalize. The present 

20 invention discloses display methods that select peptide or protein ligands that 
internalize, and further provides other related advantages. 

SUMMARY OF THE INVENTION 

Within one aspect of the present invention, a method of selecting 
internalizing ligands displayed on a genetic package is presented, comprising: (a) 
25 contacting a ligand displaying genetic package(s) with a cell(s), wherein the package 
carries a gene encoding a detectable product which is expressed upon internalization 
of the package; and (b) detecting product expressed by the cell(s); thereby selecting 
internalizing ligands displayed on a genetic package. 
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In another aspect, the invention provides a method of identifying an 
internalizing ligand displayed on a genetic package, comprising: (a) contacting one or 
more ligand displaying genetic packages with a cell(s), wherein each package carries 
a gene encoding a selectable marker which is expressed upon internalization of the 
5 package, (b) detecting the selectable marker expressed by the cell(s); and (c) 
recovering a nucleic acid molecule encoding an internalizing ligand from the cell(s) 
expressing the product, and thereby identifying an internalizing ligand displayed on a 
genetic package. 

In yet another aspect, the invention provides a method of identifying 

1 0 an internalizing ligand displayed on a genetic package, comprising: (a) contacting one 
or more ligand displaying genetic packages with a cell(s), wherein each package 
carries a gene encoding a selectable product which is expressed upon internalization 
of the package, (b) incubating the cell(s) under selective conditions; and (c) 
recovering a nucleic acid molecule encoding an internalizing ligand from the cell(s) 

15 which grow under the selective conditions; thereby identifying an internalizing ligand 
displayed on a genetic package. 

In yet another aspect a method is provided for a high throughput 
method of identifying an internalizing ligand displayed on a genetic package, 
comprising: (a) contacting one or more ligand displaying genetic packages with a 

20 cell(s) in an array, wherein each package carries a gene encoding at least one 
detectable product which is expressed upon internalization of the package: and (b) 
detecting produces) expressed by the cell(s) in the array, and thereby identifying an 
internalizing ligand displayed on a genetic package. In one embodiment, the ligand 
displaying package comprises a library of ligand displaying packages. 

25 In another aspect, the present invention provides a method of 

identifying an internalizing ligand displayed on a genetic package, comprising: (a) 
contacting one or more ligand displaying a genetic packages with a cell(s), wherein 
each package carries a selectable marker which is detectable upon internalization of 
the package, (b) detecting the selectable marker internalized by the cells; and (c) 

30 recovering a nucleic acid molecule encoding an internalizing ligand from the cell(s) 
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carrying the selectable marker, thereby identifying an internalizing ligand displayed 
on a genetic package. 

Within one aspect of the present invention, a method of selecting 
inte rn al i zi ng ligand/anti-ligand pairs is presented, comprising: (a) contacting a ligand 
5 displaying genetic package(s) with a cell(s), wherein the package carries a gene 
encoding a detectable product which is expressed upon internalization of the package; 
and (b) detecting product expressed by the cell(s); thereby selecting ligand/anti-ligand 
pairs. 

In another aspect, the invention provides a method of identifying a 

10 ligand or anti-ligand of an internalizing ligand/anti-ligand pair, comprising: (a) 
contacting one or more ligand displaying genetic packages with a cell(s), wherein 
each package carries a gene encoding a detectable product which is expressed upon 
internalization of the package, and wherein the cell(s) expresses an anti-ligand- 
receptor fusion protein on its surface; (b) detecting product expressed by the cell(s); 

15 and (c) recovering a nucleic acid molecule encoding an internalizing ligand and/or a 
nucleic acid molecule encoding an internalizing anti-ligand from the cell(s) 
expressing the product, and thereby identifying a ligand or anti-ligand of a 
internalizing ligand/anti-ligand pair. 

In yet another aspect, the invention provides a method of identifying a 

20 ligand or anti-ligand of an internalizing ligand/anti-ligand pair, comprising: (a) 
contacting one or more ligand displaying genetic packages with a cell(s), wherein 
each package carries a gene encoding a detectable product which is expressed upon 
internalization of the package, and wherein the cell(s) expresses an anti-ligand- 
receptor fusion protein on its surface; (b) incubating the cell(s) under selective 

25 conditions; and (c) recovering a nucleic acid molecule encoding an internalizing 
ligand and/or a nucleic acid molecule encoding an internalizing anti-ligand from the 
cell(s) which grow under the selective conditions; thereby identifying a ligand or anti- 
ligand of a internalizing ligand/anti-ligand pair. 

In yet another aspect, a method is provided for a high throughput 

30 method of identifying a ligand or anti-ligand of an internalizing ligand/anti-ligand 
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interactions, comprising: (a) contacting one or more ligand displaying genetic 
packages with a cell(s) in an array, wherein each package carries a gene encoding at 
least one detectable product which is expressed upon internalization of the package; 
and (b) detecting product(s) expressed by the cell(s) in the arcay, and thereby 
identifying a ligand or anti-ligand of a internalizing ligand/anti-ligand interactions. In 
one embodiment, the array contains cells expressing a library of anti-iigand-reccptor 
fusion proteins. In another embodiment, the ligand displaying package comprises a 
library of ligand displaying packages. 

Within one aspect of the present invention, a method of identifying a 
target cell or tissue for internalizing ligands is presented, comprising: (a) contacting a 
library of ligand displaying genetic packages with a cell(s) or tissue(s), wherein ~»rh 
package carries a gene encoding a detectable product which is expressed upon 
internalization of the package; and (b) detecting product expressed by the cell(s) r 
tissue(s), and thereby identifying a target cell or tissue for internalizing ligands. 

In another aspect, the invention provides a method of selecting an 
inter nalizing ligand for a selected target cell or tissue within a pool of target cells r 
tissues and identifying a target ceil or tissue for the internalizing ligand, comprising: 
(a) co nt ac ting a library of ligand displaying genetic packages with a pool of cell(s) or 
tissue(s), wherein each package carries a gene encoding a selectable marker which is 
expressed upon internalization of the package; (b) detecting the selectable marker 
expressed by the cell(s) or tissue(s); and (c) recovering a nucleic acid molecule 
encoding an internalizing ligand from a selected set of cell(s) or tissue(s) within the 
pool expressing the product 

In yet another aspect, the invention provides a method of selecting an 
internalizing ligand for a selected target cell or tissue within a pool of target cells r 
tissues and identifying a target cell or tissue for the internalizing ligand, comprising: 
(a) contacting a library of ligand displaying genetic packages with a pool of cell(s) or 
tissue(s), wherein each package carries a gene encoding a detectable product which is 
expressed upon internalization of the package; (b) incubating the cell(s) or tissue(s) 
under selective conditi ns; and (c) recovering a nucleic acid molecule encoding an 
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internalizing ligand from a selected set of cell(s) or tissue(s) within the pool which 
grow under the selective conditions; thereby selecting internalizing ligands and 
identifying a target cell or tissue for the internalizing ligand. 

In yet another aspect, a method is provided for a high throughput 
5 method of identifying target cells or tissues for internalizing ligands, comprising: (a) 
contacting a library of ligand displaying genetic packages with cells or tissue in an 
array, wherein each package carries a gene encoding at least one detectable product 
which is expressed upon internalization of the package; and (b) detecting product(s) 
expressed by the cells or tissue in the array; thereby identifying target cells or tissues 
10 for internalizing ligands. In one embodiment, the array contains a variety of cell 
types. In another embodiment, the method further comprises step (c), wherein the 
library is a library of ligand displaying bacteriophages that is repeatedly divided into 
subset pools and screened using steps (a) and (b) until a specific bacteriophage 
expressing an internalizing ligand is identified. 
15 In yet additional embodiments a medicament for gene therapy is 

provided, comprising an internalizing ligand identified by the of the present 
invention. Also provided are anti-bacterial agents comprising an internalizing ligand 
identified by the methods of the present invention. 

Also provided are methods for identifying transduction facilitating 
20 peptides, comprising: (a) contacting one or more ligand displaying a genetic packages 
with a cell(s), wherein each package displays a putative transduction facilitating 
peptide and a ligand known to internalize, and wherein each package carries a 
selectable marker which is detectable upon internalization of the package, (b) 
detecting the selectable marker internalized by the cells; and (c) recovering a nucleic 
25 acid molecule encoding an internalizing ligand from the cell(s) carrying the selectable 
marker, and thereby identifying an internalizing ligand displayed on a genetic 
package. 

In related embodiments, the selectable marker is selected from reporter 
gene expression, expression of a gene that confers the ability to permit cell growth 

» 

30 under selection conditions, non-endogenous nucleic acid sequences that permit PCR 
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amplification, and nucleic acid sequences that can be purified by protein/DNA 
binding. 

In preferred embodiments, the ligand displaying genetic package 
comprises a bacteriophage. The bacteriophage are filamentous phage or lambdoid 
5 phage in other preferred embodiments. In some embodiments, the bacteriophage 
carries a genome vector. In other embodiments, the bacteriophage carries a hybrid 
vector. 

In other embodiments, the library is a cDNA library, an antibody gene 
library, a random peptide gene library, or a mutein library. In other preferred 

10 embodiments, the de t e c table product is selected from the group consisting of green 
fluorescent protein, P-galactosidase, secreted alkaline phosphatase, chloramphenicol 
acetyttransferase, luciferase, human growth hormone and neomycin 
phosphotransferase. 

In other embodiments, the cells may be isolated by flow cytometry, for 

1 5 example. In further embodiments, the methods further comprise recovering a nucleic 
acid molecule encoding the ligand from the cell(s) expressing the product Also 
provided are methods for enhancing transduction by utilizing genotoxic agents, heat 
shock, and transduction facilitating peptides. 

These and other aspects of the present invention will become evident 

20 upon reference to the following detailed description and attached drawings. In 
addition, various references are set forth below which describe in more detail certain 
procedures or compositions (e.g., plasmids, etc.), and are therefore incorporated 
herein by reference in their entirety. 

BRIEF DESCRIPTION OF THE DRAWINGS 
25 Figures 1 A and IB are schematic representations of phage vectors for 

mammalian cell transduction. Figure 1A depicts the parent phage vector with wild 
type pin coat protein. The base vector is M13 genome with ampicillin resistance 
(Amp R ) gene and GFP expression cassette inserted into the intergenic region between 
pIV and pD (MEGFP3). The MEGFP3 vect r contains the foil wing elements: ori- 
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CMV, SV40 replication origin and CMV promoter; EGFP, enhanced green 
fluorescent protein gene; BGH, and a bovine growth hormone polyadenylation 
sequence. Figure IB represents the FGF-pIU fusion display phage (MF2/1G3). 

Figure 2 is a scanned image of a Western Blot analysis representing 
5 detection of FGF2-pW fusion protein in protein extracts from purified FGF2-phage 
(FGF2-MEGFP). 

Figures 3A and 3B are bar graphs of EL1SA detection of FGF2 on 
FGF2-phage. Figure 3A depicts the amount of phage protein detected using both the 
empty MEGFP3 (Le. t MG3) vector and the FGF2 fusion construct (FGF2-MEGFP). 
10 Figure 3B depicts the amount of FGF2 detected on the phage having the fusion 
construct. 

Figures 4A and 4B are bar graphs representing the transduction of 
COS cells by FGF2-phage. 

Figure 5 is a bar graph representing the transduction of COS cells by 
1 5 peptide display phage. 

Figure 6 is a scanned image of a Western Blot analysis representing 
detection of EGF-pID fusion protein in protein extracts from purified EGF-phage. 

Figure 7 is a bar graph representing the dose response of COS cells to 
various phage titers. 

20 Figure 8 is a bar graph representing a time course analysis of various 

incubation times and the effect on transduction. 

Figures 9A and 9B are bar graphs representing the specificity of 
transduction of COS cells by EGF-phage. 

Figure 10 is a bar graph representing transduction specificity of a 
25 variety of human carcinoma cells. 

Figure 11 is a scanned image of ethidium bromide stained gel 
electrophoretic analysis of products obtained by PCR amplification of pill genes/pill 
gene fusions following various rounds of selection. 

Figure 12 is a vector map of an AAV-Phage hybrid genome vector. 
30 Figure 1 3 is a vector map of an AAV-Phage hybrid phagemid vector. 
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Figure 14 is a bar graph depicting the effects on transduction following 
genotoxic treatment and/or heat shock treatment of target cells. 

Figure 15 is a bar graph representing the effects on transduction 
following display of an endosomal escape peptide. 
5 Figure 16 is a bar graph displaying the combined effect of heat shock 

and display of an endosomal escape peptide on transduction. 

Figure 17 is a vector map of a pUC-MG4 phagemid vector. 

Figure 1 8 is a vector map of a pUC-MG4-38 dual display vector. 

DETAILED DESCRIPTION OF THE INVENTION 
10 As noted above, the present invention provides methods of using 

ligand displaying genetic packages to identify protein-protein interactions, ligands 
that bind and internalize or that to identify target cells and/or tissues for known or 
putative ligands and to identify transduction facilitating peptides. While it should be 
understood that a variety of ligand display methods may be utilized (e.g., phage 
15 display, RNA-peptide fusions, and ligand displaying bacteria), the present invention 
uses bacteriophage ligand display to exemplify the various embodiments. 

Briefly, in one embodiment of the present invention, a library f 
antibodies, cDNAs, or genes encoding random peptides is cloned into a coat protein 
of a ligand displaying genetic package (e.g., gene III protein of filamentous phage) is 
20 utilized. The phage genome also contains an "expression cassette" encoding a 
transgene placed downstream from a cell promoter that is active in the cells to be 
infected (Figure 1A). The transgene is generally a selectable gene product and/or a 
detectable marker. Phage are contacted with test cells and expression of the transgene 
is monitored or selected. Phage that internalize will confer the phenotype of the 
25 transgene, such as drug resistance or expression of a fluorescing protein. The cells 
may be isolated on the basis of transgene expression. For example, when the 
transgene is a drug resistance gene, cells are grown in the presence of the drug, such 
that only those cells receiving and expressing the transgene are propagated. The 
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gene(s) that are fused with the coat protein and that promoted cell binding and 
internalization are recovered from the selected cells by a suitable method. 

I. DISPLAY PACKAGES 

5 A variety of ligand displaying genetic packages may be used within 

the context of the present invention. A "ligand displaying genetic package" as used 
herein* refers to any package which comprises a peptide/protein ligand and carries a 
nucleic acid molecule capable of detection, once internalized in the target cell. In one 
embodiment, the nucleic acid carried by the ligand displaying genetic package is 

10 expressed upon internalization into the cell thereby allowing for detection of an 
internalized genetic package. In other embodiments, the ligand displaying genetic 
package may cany a nucleic acid molecule which allows for detection via PCR of 
unique sequences or by the ability of the internalized nucleic acid sequences to bind 
non-endogenous DNA binding proteins {e.g., nucleic acid sequence could comprise a 

15 lac operon, thereby allowing for lac repressor binding). Accordingly, display may be 
by a virus, RNA-peptide fusions, bacteriophage, bacteria, or similar system (See, 
Phage Display of Peptides and Proteins, pages 151-193, Kay (Ed.), Academic Press, 
San Diego, 1996). P r e f e r r e d methods utilize bacteriophage. Such phage include the 
filamentous phages, lambda, T4, MS2, and the like. A preferred phage is a 

20 filamentous phage, such as Ml 3 or fl. Accordingly, many illustrations, while 
exemplifying the use of phage, could also be performed with any ligand displaying 
genetic package. 

Phage that present the foreign protein or peptide as a fusion with a 
phage coat protein are engineered to contain the appropriate coding regions. A 

25 variety of bacteriophage and coat proteins may be used. Examples include, without 
limitation, Ml 3 gene III, gene VIII; fd minor coat protein pin (Saggio et a/., Gene 
152:35, 1995); lambda D protein (Sternberg and Hoess, Proc. Natl. Acad Set. USA 
92:1609, 1995; Mikawa et aU J- Mol Biol 26221, 1996); lambda phage tail protein 
pV (Maruyama et ai 9 Proc. Natl Acad Sci. USA 97:8273, 1994; U.S. Patent No. 

30 5627024); ft coat protein (WO 96/1 1 947; DD 292928; DD 2868 1 7; DD 300652); $29 
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tail protein gp9 (Lee, Virol. 6P:5018, 1995); MS2 c at protein; T4 small outer capsid 
protein (Ren et al., Protein Sci. 5:1833, 1996), T4 nonessential capsid scaffold 
protein IPUI (Hong and Black, Virology 1 94:4*1, 1993), or T4 lengthened fibritin 
protein gene (Efimov, Virus Genes 20:173, 1995): PRD-1 gene III; QP3 capsid 
5 protein (as long as dimerization is not interfered with); and P22 tailspike protein 
(Carbonell and Villaverde, Gene 176:225, 1996). Techniques for inserting foreign 
coding sequence into a phage gene are well known (see e.g., Sambrook et al. 
Molecular Cloning: A Laboratory Approach, Cold Spring Harbor Press, NY, 1989; 
Ausubel et al., Current Protocols in Molecular Biology, Greene Publishing Co., NY, 
10 1995). 

In the preferred filamentous phage system, a wide range of vectors are 
available (see, Kay et al., Phage Display of Peptides and Proteins: A Laboratory 
Manual, Academic Press, San Diego, 1996). The most common vectors accept 
inserts in gene III or gene VIII. Furthermore, the foreign gene can be inserted directly 
15 into the phage genome or into a phagemid vector. Methods of propagation of 
fi l a ment ous phage and phagemids are well known. 

Filamentous phage vectors generally fall into two categories: phage 
genome and phagemids. Either type of vector may be used within the context of the 
present invention. Many such commercial vectors are available. For example, the 

20 pEGFP vector series (Clontech; Palo Alto, CA), M13mp vectors (Pharmacia Biotech, 
Sweden), pCANTAB 5E (Pharmacia Biotech), pBluescript series (Stratagene Cloning 
Systems, La Jolla, CA), pComb3 and M13KE (New England Biolabs), and others 
may be used. One particularly useful commercial phagemid vector is pEGFP-Nl, 
which contains a green fluorescent protein (GFP) gene under control of the CMV 

25 immediate-early promoter. This plasmid also includes an S V40 origin of replication 
to enhance gene expression by allowing replication of the phagemid to high copy 
number in cells that make SV40 T antigen. 

Other vectors are available in the scientific community (see e.g., 
Smith, in Vectors: A Survey of Molecular Cloning Vectors and their Uses, Rodriquez 

30 and Denhardt, eds., Butterworth, Boston, pp 61-84, 1988) or may be c nstructed 
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using standard methods (Sambrook et aL, Molecular Biology: A Laboratory 
Approach, Cold Spring Harbor, NY, 1989; Ausubel et al. y Current Protocols in 
Molecular Biology* Greene Publishing, NY, 1 995) guided by the principles discussed 
below. 

5 The source of the ligand {e.g., gene, gene fragment, peptide encoding 

sequence, chemically conjugated peptide or protein, non-covalently conjugated 
peptide or protein) may be for example, derived from a cDNA library, antibody 
library or random peptide library. Alternatively, the ligand may be from a library of 
random or selective mutations of a known ligand. In an additional alternative, the 
10 ligand may be from a library of known receptor or cell surface binding agents. For 
example, the library may contain a subset of peptides known to bind the FGF or EGF 
receptor, but that have unknown gene delivery and expression characteristics {i.e., 
transduction capacity). 

When a cDNA library is used, the starting cDNA is synthesized from 

15 mRNA isolated from the source tissue or cell line from which the desired ligand 
originates. cDNA is then amplified using primers containing sequences f 
appropriate restriction enzyme sites for insertion into the desired vector. 
Alternatively, commercially available cDNA libraries {e.g., Clontech; Palo Alto, CA) 
may be amplified for insertion into the vector. 

20 Similarly, libraries of antibody fragments can be made from mRNA 

isolated from the spleen cells of immunized animals (immunized for example with 
whole target cells or membranes) or subcloned from existing antibody libraries from 
immunized or naive animals. Random peptide encoding sequences are subcloned 
from libraries that are commercially available (New England Biolabs; MA) or can be 

25 synthesized and cloned using previously described methods (see, Kay et al. , supra). 

Phage display libraries of random or selective mutations of known 
ligands (referred to herein as a "mutein library") for improved gene delivery are 
performed in the same manner as described for screening random peptide libraries. 
Random mutations of the native ligand gene may be generated using DN A shuffling 

30 as described by Stemmer {Nature 570:389-391, 1994). Briefly, in this method, the 
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ligand is amplified and randomly digested with DNase I. The 50-300 base pair 
fragments are reassembled in an amplification performed without primers and using 
Tag DNA polymerase or similar enzyme. The high error rate of this polymerase 
introduces random mutations in the fragments that are reassembled at random thus 
introducing combinatorial variations of different mutations distributed over the length 
of the gene. Error prone amplification may alternatively be used to introduce random 
mutations (Bartell and Szostak, Science, 267:1411, 1993). The ligand may be 
mutated by cassette mutagenesis (Hutchison et aL in Methods in Enzymology 
202:356-390, 1991), in which random mutations are introduced using synthetic 
oligonucleotides and cloned into the ligand to create a library of ligands with altered 
binding specificities. Additional mutation methods can be used. Some additional 
methods are described in Kay et al. 9 supra. Further, selective mutations at 
predetennined sites may be performed using standard molecular biological techniques 
(Sambrook et al. 9 Molecular Cloning : A Laboratory Manual, Cold Spring Harbor 
15 Press, 1989). 

If a cDNA library cannot be generated because, for example, the 
source of the desired ligand is not available or is unknown, random peptide libraries 
or a cDNA library from placenta may be used as a starting point for screening. 
Methods for construction of random peptide libraries may be found, for example, in 
20 Kay et a/., supra. Briefly, the random peptides are encoded by DNA assembled from 
degenerate oligonucleotides and inserted into one of the bacteriophage vectors 
described herein. Several different strategies may be used to generate random 
peptides or peptide encoding sequences. For example, triplets ofNNN, wherein each 
N is an equimolar representation of all four nucleotides, will generate all 20 amino 
25 acids (as well as 3 stop codons). Alternative strategies use NN(G/T) and NN(G/C), 
which results in 32 codons that encodes all 20 amino acids and only 1 stop codon. 
Other strategies utilize synthesis of mixtures of trinucleotide codons representing all 
20 amino acids and no stop codons. Once the oligonucleotides are synthesized, they 
are assembled as double strands by a variety of schemes, one of which involves 
30 synthesis of the complementary strand (see Kay et al. 9 supra). 
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in addition to the ligand/coat protein fusion, in one embodiment, the 
vector may contain a gene whose product can be detected or selected for. As referred 
to herein, a "reporter" gene is one whose product can be detected, such as by 
fluorescence, enzyme activity on a chromogenic or fluorescent substrate, and the like 
5 or selected for by growth conditions. Such reporter genes include, without limitation, 
green fluorescent protein (GFP), P~galactosidase, chloramphenicol acetyltransferase 
(CAT), luciferase, neomycin phosphotransferase, secreted alkaline phosphatase 
(SEAP), and human growth hormone (HGH). Selectable markers include drug 
resistances, such as neomycin (G41 8), hygromycin, and the like. 

10 The marker gene is in operative linkage with a promoter. Any 

promoter that is active in the cells to be transfected can be used. The vector should 
also have a viral origin of replication and a packaging signal for assembling the 
vector DNA with the capsid proteins. 

Most applications of the present invention will involve transfection f 

15 m a mm a li an cells, including human, canine, feline, equine, and the like. However* 
several embodiments of the present invention utilize expression in non-mammalian 
cells (e.g., fungal, yeast, bacteria, and plant). The choice of the promoter will depend 
in part upon the targeted cell type. Promoters that are suitable within the context of 
the present invention include, without limitation, constitutive, inducible, tissue 

20 specific, cell type specific, temporal specific, or event-specific, although constitutive 
promoters are preferred. 

Examples of constitutive or nonspecific promoters include the SV40 
early promoter (U.S. Patent No. 5,118,627), the SV40 late promoter (U.S. Patent 
No. 5,1 18,627), CMV early gene promoter (U.S. Patent No. 5,168.062), bovine 

25 papilloma virus promoter, and adenovirus promoter. In addition to viral promoters, 
cellular promoters are also amenable within the context of this invention. In 
particular, cellular promoters for the so-called housekeeping genes are useful (e.g., 0- 
actin). Viral promoters are generally stronger promoters than cellular promoters. 

Additional promoters known and available to those of skill in the art 

30 may also be useful within the context of the present invention. For example, if 



BNSDOC1D <WO 0029555A1 i > 



WO 00/29555 



PCT/US99/2S36! 



15 

display packages are to be used to transduce yeast, a yeast promoter will be required 
and are available in the context of a number of commercially available vectors from a 
variety of sources including Clontech, (Palo Alto, CA), and Invitrogen (Carlsbad, 
CA). Further, if transduction of bacteria is of interest a number of bacterial vect rs 
5 are available of which most are derived from the pUC lineage. In addition, a variety 
of plant vectors and promoters are available, for example general descriptions of plant 
expression vectors and reporter genes can be found in Gruber et al, "Vectors for 
Plant Transformation, in Methods in Plant Molecular Biology & Biotechnology" in 
Glich et al 9 Eds. pp. 89-1 19, CRC Press, 1993. Promoters useful within the context 
10 of the present invention include both constitutive and inducible natural promoters as 
well as engineered promoters. Such promoters may be obtained from plants, viruses, 
or other sources, and include, but are not limited to: those described herein such as 
the 35S promoter of cauliflower mosaic virus (CaMV). Typically, for plant 
expression vectors, suitable promoters include the 35S RNA and 19S RNA promoters 
15 of CaMV (Brisson et aL. Nature 370:51 1, 1984; Odell et aL, Nature J7J:810, 1985); 
the full-length transcript promoter from Figwort Mosaic Virs FMV) (Gowda et aL, J. 
Cell Biochenu 13D: 301, 1989 and U.S. Patent No. 5,378,619) and the coat protein 
promoter to TMV (Takamatsu et aL, EMBO J 6:307, 1987). Alternatively, plant 
promoters such as the light-inducible promoter from the small subunit of ribulose bis- 
20 phosphate carboxylase (ssRUBISCO) (Coruzzi et aL , EMBO J 3: 1 67 1 , 1 984; Broglie 
et al„ Science 224:838, 1984); mannopine synthase promoter (Velten et aL, EMBO J 
1984) nopaline synthase (NOS) and octopine synthase (OCS) promoters 
(carried on tumor-inducing plasmids of Agrobacterium tumefaciens) or heat shock 
promoters, e.g., soybean hspl7.5-E or hsp!7.3-B (Gurley et aL. MoL Cell. BioL 6:559, 
25 1986; Severin et aL, Plant MoL BioL 75:827, 1990) may be used. See PCT 
Publication WO 91/19806 for a review of a variety of known plant promoters which 
are suitable for use within the context of the present invention. 

In preferred embodiments, the phage has an origin of replication 
suitable for the transfected cells. Viral replication systems, such as EBV ori and 
30 EBNA gene, SV40 ori and T antigen, r BPV ori, may be used. Other mammalian 
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replication systems may be interchanged. As well, the replication genes .may cause 
high copy number. Expression of therapeutic genes from the phage genome may be 
enhanced by increasing the copy number of the phage genome. In one method, the 
SV40 origin of replication is used in the presence of S V40 T antigen to cause several 
5 hundred thousand copy number. The T antigen gene may be already present in the 
cells, introduced separately, or included in the phage genome under the 
transcriptional control of a suitable cell promoter. Other viral replication systems for 
increasing copy number can also be used, such as EBV origin and EBNA. 

As noted above, phagemid vectors may also be utilized in the practice 
10 of the present invention. Phagemid vectors are plasmid vectors that contain 
filamentous phage sequences and therefore can be packaged into phage particles 
when the complementing phage structural proteins are provided in trans by helper 
phage. In conventional phagemid systems both the phagemid and helper phage 
encode pill coat protein. A combination of both wild type and pill-fusion protein is 

IS displayed on the resulting phagemid particles with the phagemid encoding the pin 
fusion and the helper encoding wild type pill. This results in a monovalent display of 
peptides or proteins with often less than one recombinant pill fusion protein 
displayed per phage particle. Many antibody libraries are made in phagemid vectors 
because monovalent display is advantageous for selecting high affinity antibodies 

20 over lower affinity antibodies which in a high valence system might be selected on 
the basis of avidity. While similar concerns may justify the use of monovalent 
display vectors in the practice of the present invention, current data suggests, 
however, that multivalent display is important for mammalian cell internalization. 
(Larocca et a/., FASEB J. 75:727-734, 1999; Larocca et aL Human Gene Therapy 

25 P:2393-2399, 1998; Beceml et aL Biochem. Biophys. Res. Comm. 255:386-393; 
Ivanenkov et al. t Biochim. Biophys. Acta 7^:450-462, 1999). Thus, a phagemid 
vector with multivalent display would likely be more useful for selecting internalizing 
phage in mammalian cells. 

Multivalency in a phagemid system may be provided by rescuing with 

30 helper having a gill deletion (or gVIII deletion, depending on the phagemid). 
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Accordingly, rescuing with a gill deleted helper can alter the valency of phagemid 
vectors. Rakonjac and colleagues have developed host strains which allow 
production of the gill deleted helper phage with very low background of gill 
containing recombinants (as low as 1 in 10 9 ). See, e.g., Rakonjac et aL Gene 298:99- 
5 103, 1997. In this system there is little or no wild type pill provided by the helper 
phage so that each phage displays multiple copies of the pin fusion derived from the 
phagemid. Thus, in one embodiment the present invention utilizes a vector 
containing the EGF-pIII fusion protein from the phage vector, MG4-EGF (the vector 
MG4 is constructed by reversing the orientation of the SV40ori/CMV/GFP 

10 expression cassette in the MEGFP3 (MG3) phage vector. MG4-EGF has the EGF 
encoding gene inserted "in-frame 1 * at the pstl/Ncol sites in MG4) and similarly the 
control vector containing the CMV/S V40ori/GFP cassette and the pill gene from the 
control phage. The pIU genes are under the transcriptional control of the inducible 
lac promoter to minimize synthesis of pIU protein in the absence of a suitable inducer 

15 {Le. IPTG). 

Construction and utilization of multivalent phagemid vectors is within 
the knowledge of those of skill in the art. Briefly, the backbone of the above 
phagemids can be easily constructed from pUCl 19 (purchased from ATTC) which 
contains an Ml 3 phage origin of replication and an ampicillin resistance gene. The 

20 CMV/SV40ori/GFP gene is PCR amplified using primers that incorporate an AfllU 
endonuclease site and is inserted into the unique AfUII site in pUCl 19 to create pUC- 
GFP. The pIU and EGF-pIII fusion genes are subcloned from MG4 and MG4-EGF 
by PCR amplification using primers that encode Hindlll and EcoRl endonuclease 
sites and insertion into pUC-GFP at the unique HindOI and EcoRl sites in the 

25 multicloning site to create pUC-MG4 (Figure 17) and pUC-MG4-EGF. The 
phagemid DNA constructs are used to transform host bacteria (XL-1 Blue; 
Stratagene, San Diego, CA) that contain the lac IQ repressor protein gene to suppress 
expression of the pill or EGF-pIII fusion gene in the absence of IPTG. Phagemid 
panicles are rescued from transformed bacteria with a wild type helper phage (i.e. 

30 R408 or VCSM13) or a gill deleted helper (Le. R408d3 or VCSM13d3, Stratagene, 
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San Diego, CA) and tested on COS 1 cells for phage mediated transduction efficiency 
as measured by % GFP positive cells. The results of these experiments (not shown) 
indicate that the multivalent phage (VCSM13d3 rescued) are about 2 orders of 
magnitude more efficient at transducing COS1 cells than the monovalent display 
5 phagemid particles (wild type rescued). Western blot analysis (not shown) of CsCl 
phagemid particles reveals that about 50% of the pill protein displayed on the 
VCSM13d3 rescued phagemid particles is fused to EGF; the remaining pill protein is 
likely derived from a proteolytic cleavage of the fusion protein. Analysis of the 
monovalent phagemid particles shows that more than 66% of the displayed pill is the 

10 length of wild type pill and of MG4-EGF phage shows that 60-80% of the pill 
displayed is full length. Accordingly, such data demonstrate that multivalent display 
of the targeting ligand facilitates uptake of the ligand targeted phage via cognate 
receptor mediated endocytosis and that it is feasible to construct and use a multivalent 
phagemid system for phage mediated mammalian cell transduction. 

IS In yet further embodiments, ligand fusions to a truncated pill or pVIIl 

gene may be utilized. Several advantages become apparent when expressing 
displayed ligands as fusions to a truncated pIIJ (or pVTII) gene. For example, rescue 
by helper phage is simplified because there is no interference of phagemid derived 
pill with the infection by helper phage. Thus multivalent phagemid yields will likely 

20 be increased by infection with helper phage after induction of the phagemid pill 
fusion gene (not possible with fusions to full length pill because of interference). 
Additionally some ligands may be more efficiently expressed as a fusion to truncated 

pm. 

Phagemid vectors having truncated fusions to pill or pVHl are easily 
25 produced by those of ordinary skill in the art. Briefly, a phagemid vector is created 
with a pin gene having only domain 2 of pill (preferably starting from amino acids 
198-250 and ending at amino acid 406), as described by Dottavio (in Phage Display 
of Peptides and Proteins, Kay et al. (Eds.) San Diego, Academic Press, 1998). When 
this phagemid is rescued with the R408d3 or VCSM13d3 helper phage the resulting 
30 phagemid particles are no longer infective in bacteria since domain 1 of pill is 
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required for infectivity. However these panicles can be used for non-bacterial cell 
transduction and subsequent rescue of ligand encoding sequences as described herein. 
Further, to enhance ligand recognition it may be beneficial to fuse the sequence 
encoding the peptide or protein ligand or the peptide or protein ligand itself to domain 
5 2 of pill via a linker molecule (e.g., gly 4 ser, heterobifunctional linkers, and the like). 

In a further embodiment a viral replication system such as an SV40 
based shuttle vector can be used as the phagemid. Accordingly, cells infected by the 
ligand-expressing phage package the phagemid DNA into SV40 viral particles. 
These viral particles infect neighboring cells, thus spreading the phagemid DNA. 

10 Following growth in culture, a dish of cells contains millions of copies of the original 
phagemid, thereby enriching the population of internalized ligand encoding genetic 
packages several fold. The cell lines used with such a system can either be 
transfected with DNA sequences encoding SV40 small and large T-antigens or can 
contain the proteins through delivery with fusion constructs, such as VP22. VP22 is a 

15 herpes virus structural protein that is exported from cells and spreads to neighboring 
cells where it concentrates in the nucleus (Elliot and OUare, Cell 55:223-233, 1997). 
VP22-SV40 T antigen fusion protein encoding vectors exist in the art and are 
available from Invitrogen Corp., as the pVP22myc-His vector. The vector useful in 
this application ideally contains an fl, Ml 3 or comparable phage origin, a coat 

20 protein fusion cassette (pill or pVIII), the SV40 late region genes, and an SV40 
origin.. 

Similarly, an adeno-associated virus/phage hybrid vector may be used 
to achieve the same amplified ligand production. An AAV -phage hybrid vector 
combines selected elements of both vector systems, providing a vector that is simple 

25 to produce in bacteria that is not constrained by a capsid packaging limit, while 
allowing infection of quiescent cells combined with integration into the host 
chromosome. Vectors containing many of the appropriate elements are readily 
available, and can be further modified by standard methodologies to include the 
necessary sequences. For example, the phagemid pAAV/Svneo, ATCC Accession 

30 No. 68065, contains AAV ITR sequences and an Fl origin of replication. In addition 
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the vector pAV.CMV.LacZ provides many of the appropriate elements Fisher et al. 
J. Virol 70<7;:52O-532, 1996. 

Adenoassociated virus (AAV) is a defective member of the 
parvovirus family. The AAV genome is encapsulated as a single-stranded DNA 
5 molecule of plus or minus polarity (Berns and Rose, J. Virol 5:693-699, 1970; 
Blacklow et al., J. Exp. Med 215:755-163, 1967). Strands of both polarities are 
packaged, but in separate virus particles (Berns and Adler, Virology 9:394-396, 1972) 
and both strands are infectious (Samulski et al y J. Virol 67:3096-3101, 1987). The 
single-stranded DNA genome of the human adenoassociated virus type 2 (AAV2) is 

10 4681 base pairs in length and is flanked by inverted terminal repeated sequences of 
145 base pairs each (Lusby et al. t J, Virol 47:518-526, 1982; Muzyczka, Curr. Top. 
Microbiol Immunol 755:97-129, 1992). In addition, the viral rep protein appears to 
mediate non-homologous recombination through the ITRs (Giraud et al.. J. Virol 
5P:6917-6924, 1995; Linden et al, Proc. Natl Acad Sci. USA 95:7966-7972, 1996). 

15 Accordingly, as parvoviral genomes have ITR sequences at each end which play a 
role in recombination and which are generally required for parvoviral replication and 
packaging, the vectors of the present invention generally contain all or a portion of at 
least one of the ITRs or a functional equivalent thereof. 

Adeno-associated viruses may be readily obtained and their use as 

20 vectors for gene delivery has been described in, for example, Muzyczka, Curr. Top. 
Microbiol Immunol 7J«:97-129, 1992; U.S. Patent No. 4,797,368, and PCT 
Application WO 91/1 8088. Construction of AAV vectors is described in a number of 
publications, including U.S. Patent No. 5,173,414; Lebkowski et al. Mol Cell Biol 
5:3988-3996, 1988; Tratschin et al. Mol Cell Biol 5(11)325 1-3260; Hermonat and 

25 Muzyczka, Proc. Natl Acad ScL USA 57:6466-6470, 1984; U.S. Patent Nos. 
5,871,982, 5,773,289, 5,843,742, and 5,474,935; and PCT Application Nos. WO 
98/45462 and WO 98/48005, all of which are incorporated herein by reference. 

AAV-2 can be propagated as a lytic virus or maintained as a pro virus, 
integrated into host cell DNA (Cukor et al, in "The Parvoviruses," Berns ed., Plenum 

30 Publishing Corp., N.Y. pp. 33-66, 1984). Alth ugh under certain conditions AAV 
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can replicate in the absence of helper virus (Yakobson et al. J. Virol d/:972-98l. 
1987), efficient replication requires coinfection with either adenovirus (Atchinson et 
al, Science J 94:754-756, 1965; Hoggan. Fed Proc. Am. Soc. Exp. Biol 2f:248, 
1965; Parks et al., J. Virol 7:171-180, 1967); heipes simplex virus (Buller et al, J. 
5 Virol 40241-247, 1981) or cytomegalovirus, Epstein-Barr virus, or vaccinia virus. 
Hence the classification of AAV as a "defective 91 vims. 

The AAV-phage hybrid vector for ligand identification generally 
comprises an Fl, Ml 3 or comparable origin, a coat protein fusion cassette (e.g., pill- 
ligand or pVHI-ligand), a bacterial gene which facilitates selection, such as ampicillin 

10 resistance, and the two AAV ITRs (or functional equivalents thereof), between which 
is inserted the promoter driven reporter or selectable gene. The hybrid phage can then 
be used to transduce cells. The positive cells are identified and the ligand sequences 
are amplified by PCR using the ITRs or coat protein gene as the template sequence. 
This amplified ligand fusion construct can then be sub-cloned into other vectors for 

15 further rounds of transfection, selection, and ligand sequence identification. 

In related embodiments, the present invention provides an AAV-phage 
vector that is designed to produce functional AAV viral particles upon co-infection 
with a "rescue" virus, such as an adenovirus. In this embodiment, the AAV-phage 
particle enters the cell as a "phage particle", but once inside, produces AAV particles 

20 that infect surrounding cells. This method can thus be used to amplify gene delivery 
effectiveness in a target organ or tissue. Briefly, in this approach, a ligand displaying 
phage particle containing a AAV-phage vector which contains a transgene of interest 
as well as ligand fused to the pill or pVIIl coat protein is used to transduce a cell. 
Concurrent with or subsequent to contacting the cells with the transgene containing 

25 AAV-phage particle, a ligand displaying helper AAV-phage is also used to supply the 
rep and cap genes for viral particle formation. Following transduction with the 
appropriate bacteriophage, the cell is infected with a "rescue" virus, such as an 
adenovirus, which allows viral particles to form and infect neighboring cells. 
Alternatively, the rep and cap functions can be supplied on the helper adenoviral 

30 genome. 



BNSDOC1D: *WO 0029555*1 I > 



WO 00729555 



22 



PCTAJS99/25361 



In another aspect, mutant coat proteins or additional components or 
methods may be used to increase transduction efficiency. A particularly preferred 
manipulation is to mutagenize a coat protein so as to facilitate uncoating upon cellular 
internalization. For example, the filamentous phage coat protein VIII encoding gene 
5 can be mutagenized such that it is encodes a slightly unstable protein and thus allows 
more rapid uncoating and increased transduction capacity. Accordingly, one of 
ordinary skill in the art would readily recognize that given the teachings presented 
herein that mutations may be selected for using reporter gene expression. 

In the various embodiments, when utilizing filamentous phage or 

1 0 another single-stranded DNA vector, transduction may be enhanced by facilitating the 
conversion of the single-stranded DNA to double-stranded DNA. The mechanism for 
conversion of single-stranded phage DNA to double-stranded DNA is analogous to 
that which occurs during infection with single-stranded DNA genome of a parvovirus 
such as an adenoassociated virus (AAV). In the case of recombinant AAV, 

IS conversion to dsDNA is a rate-limiting step for efficient transduction. The E4 orffi 
gene product provided by helper Adenovirus or on a separate expression plasmid 
provides this function and increases transduction efficiency between 100-1000 fold. 
Accordingly, as variety of genotoxic treatments including gamma radiation, UV, heat 
shock, and DNA synthesis inhibitors and topoisomerase inhibitors (e.g., hydroxyurea, 

20 camptothecin, etoposide, and the like, see, e.g., Ferrari et al, 1 Virol 70:3227-3234, 
1996; Alexander et al., J. Virol 65:8282-8287, 1994; Russell et al, PNAS 92:5719- 
5723, 1995) will increase rAAV transduction efficiency in the absence of infection by 
helper virus, similar treatments can also increase the transduction efficiency f 
recombinant phage vectors and thus may be utilized in the practice of the present 

25 invention. 

In other embodiments, peptides or other moieties that allow or 
promote the escape of the vectors (and any molecule attached thereto or enclosed 
therein) from the endosome are incorporated and expressed on or attached to the 
surface of the of the ligand displaying genetic package (e.g., bacteriophage). Such 

■ 

30 "other moieties" include molecules that are not themselves peptides but which have 
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the ability to disrupt the endosomal membrane, thereby facilitating the escape of the 
vector, and molecules that otherwise mimic the endosomal escape properties of the 
within-described peptide sequences (see. e.g., published PCT Application No. WO 
96/10038, the disclosure of which is incorporated by reference herein). 

Peptide sequences that confer the ability to escape the endosome are 
particularly preferred. Such sequences are well known and can be readily fused or 
conjugated covalently or genetically to a coat protein, such as gene III or gene VIII of 
filamentous phage. Although fusion of one or more peptide sequences to a coat 
protein is described herein as a preferred embodiment, it should be understood that 
other methods of attachment — and other moieties besides peptides — are useful as 
well. Further, as those of skill in the art can readily appreciate, the present invention 
may be utilized to screen for peptide sequences that enhance transduction via 
endosomal escape or nuclear targeting when combined with ligand targeting agents. 
Accordingly, dual display peptides may be utilized for such screening, wherein one 
coat fusion or conjugate represents a known ligand and the other coat protein fusion 
or conjugate represents the sequence to be screened. 

As with single display vectors, novel peptides or variants may be 
selected through rounds of cell contact and recovery of cells expressing the reporter 
gene, followed by identification of the encoded ligand. 

In yet other embodiments, combinations of elements that facilitate 
transduction may be utilized. For example a combination of UV and heat shock or 
heat shock and an endosomal escape peptide or nuclear targeting peptide may be 
utilized. 

In yet another embodiment, cell attachment moieties (e.g., peptides) 
may be chemically conjugated, non-covalent, (e.g., electrostatic, antibody-antigen, 
biotin-streptavidin, etc.) or genetically fused to the exterior of the ligand display 
package. For example, many animal viruses encode sequences for both general cell 
binding and sequences specific for internalization. In this regard, adenovirus particles 
use both the knob protein (for receptor binding) and integrin binding for 
internalization. Highly charged proteins such as polylysine also facilitate binding to 
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the negatively charged cell membrane, but may also result in undesirable non-specific 
transduction of any cell. Thus, when non-specific transduction is sought highly 
charged peptides such as polylysine, histone, etc. may be used. However, in the 
practice of the present invention preferred peptides are those that will bind to cells at 
5 low affinity and facilitate internalization only in the presence of a second displayed 
ligand (e.g., EGF). Examples include L-selectin, the sequences encoding known 
heparin binding peptides (e.g., residues 65-81 from FGF2 which have been 
implicated in heparin binding (lmamura et al. t Biochem. Biophys. Acta 7266:124-130, 
1995)) and the heparin binding sequences identified in angio-associated migratory 
10 cell protein (Beckner et al.. Cancer Res. 55:2140-2149, 1995) thai fit a heparin 
binding consensus RRXRRX (Cardin and Weintraub, Arteriosclerosis 9:21-32, 1995) 
and RGD containing sequences (Pierschbacher and Ruoslahti, Nature 509:30-33, 
1 984) that bind cell surface integrins. 

As one of skill in the art can readily appreciate the selecti n 
15 methodologies described herein may be utilized to scre en for additional cell 
a tt ach men t peptides. Briefly, cell attachment peptides may be conjugated or 
genetically fused to the exterior of a ligand display package. Typically, the cell 
attachment moiety will be attached to a display package such as a dual display phage 
such that an internalizing ligand and the cell attachment moiety will be jointly 
20 displayed. Accordingly, such joint display allows one of skill in the art to detect a 
change in transduction efficiency between packages displaying internalizing ligands 
alone and co-display in the presence of a cell attachment moiety (e.g., peptide). 

In one example, to allow independent binding of the EGF-pIII fusion 
protein and the accessory cell attachment peptide, the cell attachment peptides are 
25 fused or conjugated to the phage major coat protein, pVlII (a small protein that makes 
up the tubular protein sheath that encapsulates the phage genome of which there are 
about 2700 copies per particle). Further, it has been established that pVIII can 
tolerate the addition of small peptides on its N-terminus (Ilyichev et aL FEBS Lett 
507:322-324, 1992; Makowski Gene 725:5-11, 1993) which is used to produce the 
30 so-called "landscape phage" (Petrenko et a!.. Protein Eng. 9:797-801, 1999). Small 
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peptides (- 6-8aa) may be fused directly to the N-terminus or made as substitutions in 
the middle of pVIII as described by Petrenko (supra). For larger peptides or where 
more than one peptide is to be inserted into or conjugated to the coat, a wild type 
pVIII protein may be included in the phage system as described for the dual display 
5 phagemid. Once sequences are found that enhance transduction efficiency in a 
representative cell line such as COS1 or other cells, the sequences may be further 
op timiz e d by mutation and further selection by the ligand identification methods 
described herein. 

As noted above, dual display vectors are useful within the context of 
1 0 the present invention. In dual display embodiments additional elements displayed on 
a coat protein (e.g., pUl or pVTII) such as an endosomal escape sequence or random 
peptide sequence may be incorporated into or conjugated to the phage particle to 
enhance phage mediated transduction or to test for enhanced transduction. These 
elements enhance transduction by facilitating cell binding, endosomal escape, nuclear 

IS localization, and other points along die transduction pathway that are currently rate 
limit i ng . Accordingly, it would be advantageous to display separate elements on pin 
and pVm (or analogous exterior proteins) such that for example, the targeting ligand 
is expressed on pill and the endosomal escape sequence on pVIII. Such separation 
may also m i n i mi ze the possibility of interference of the function of one element with 

20 the other. Thus, a phagemid vector can be constructed that allows fusion and display 
of distinct peptides or proteins on both pill and pVIII. For example, in one 
embodiment, only one pID gene (the fusion) may be present when phage rescue is 
performed with pill deleted helper phage. However, 2 pVIII genes are present: one on 
the phagemid (the pVIII fusion) and one (wild-type) on the helper phage. Thus the 

25 vector is designed to display a peptide or protein ligand on pill and additional 
accessory peptide(s) or protein(s) on pVIII as a mosaic with wild type pVIII. Further, 
as there may exist an upper limit to displaying peptides on pVIII because peptides 
larger than about 8 residues interfere with assembly of the phage particle (Ilyichev et 
al (supra); Makowski, (supra)), displaying the accessory protein as a mosaic allows 
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for proper particle assembly and the display of peptides larger than 6-8 residues on 
the major coat protein, pVm. 

Thus, an example of a dual display filamentous phage presents a 
ligand (e.g., FGF) as a fusion to gene III and an endosomal escape peptide fused to 
gene VIII. The locations of the ligand and escape sequences are interchangeable. 
Escape sequences that are suitable include, without limitation, the following 
exemplary sequences: a peptide of Pseudomonas exotoxin (Donnelly, J J., et al., 
PNAS 90:3530-3534, 1 993); influenza peptides such as the HA peptide and peptides 
derived therefrom, such as peptide FPI3; Sendai Virus fusogenic peptide; the 
fusogenic sequence from HTV gpl protein; Paradaxin fusogenic peptide; and Melittin 
fusogenic peptide (see WO 96/4 1 606). 

Another sequence that may be included in a vector is a sequence that 
facilitates trafficking proteins into the nucleus. These so-called nuclear translocation 
or nuclear localization sequences (NLS) are generally rich in positively charged 
amino acids. Because the carboxyl terminus of gene VIII protein of filamentous 
phage already carries a positive charge, increased charge and likeliness of nuclear 
transport may be enhanced by fusing known mammalian cell NLS sequences to the 
gene VIII protein. NLS fusions to other coat proteins of filamentous phage may be 
substituted. 

Examples of NLS sequences include those resembling the short basic 
NLS of the SV40 T antigen; the bipartite NLS of nucleoplasmin; the 
ribonucleoprotein sequence Al; the small nuclear ribonucleoprotein sequence U1A* 
and human T-lymphocyte virus- 1 Tax protein. Other useful NLS sequences include 
the HTV matrix protein NLS; and the nuclear translocation components 
importain/hSRPl and Ran/TC4; the consensus sequence KXX(K/R) flanked by Pro 
or Ala; the nuclear translocation sequence of nucleoplasmin; or the NLS from 
antennapedia (see WO 96/41606). 

Further, sequences which direct the genetic package to various cellular 
compartments may be useful within the context of the present invention. For 
example, while FGF appears to be trafficked to the nucleus via a nuclear localization- 
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like peptide, EGF appears to be trafficked through the lysosome. Accordingly, in 
addition to the putative ligand, a lysosomal directing sequence may be incorporated 
into one of the coat proteins of the genetic package. Exemplary sequences in this 
regard are KCPL which acts as a lysosomal targeting sequence (Blagoveshchenskaya 
5 et aL, J. Biol. Chem. 275^:27896-27903, 1998), the ubiquitin-dependent 
endocytosis motif DSWVEFIELD (Govers et aL, EMBO J. 7*<7;:28-36, 1999, and 
DQRDLI or EQLPML from MCHII which also target the lysosome (Kang et aL J. 
Biol Chem. 273(32)20644-20652, 1998). 

As described herein, the library is then propagated in the display phage 
10 by transfection of a suitable bacteria host (e.g., DH5aF for filamentous phages), and 
growing the culture, with the addition of a replication-competent helper virus (for 
phagemid vectors) if necessary, overnight at 37°C. The phage particles are isolated 
from the culture medium using standard protocols. 

Infection of mammalian cells with phage is performed under 
15 conditions that block entry of wild type phage into cells (Barry et aL, Nature 
Med 2.299-305, 1996). Phage are added directly to cells, typically at titers of £10 12 
CFU/ml in a buffer, such as PBS with 0.1% BSA or other suitable blocking agents, 
and allowed to incubate with the cells at 37°C or on ice. The amount of phage added 
to cells will depend in part upon the complexity of the library. For example, a phage 
20 display library containing 1 0 5 members has each member represented 10 6 times in 1 
ml of a typical phage titer of 10 n colony forming units/ml. 

D. DETECTION/SELECTION OF TRANSGENE EXPRESSION 

The genetic package display library is ultimately screened against the 
25 target tissue or cell line. Screening can be performed in vitro or in vivo. While 
combinatorial screening methods have been performed in the past, these methods are 
unable to determine the transduction capability of the displayed ligand (see. U.S. 
Patent No. 5,733,731, incorporated herein by reference). The criteria for a positive 
"hit" in the present invention is that the phage must be able to bind, be internalized, 
30 and enable detection of the internalized ligand by detecting a selectable marker, such 
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as, for example, by expressing the phage genomic DNA containing the 
reporter/selectable gene in the target or test cell or allowing direct nucleic acid 
detection (e.g., PCR or DNA binding). In this regard, while not wishing to be bound 
to a particular theory, it is believed that the phage should bind, internalize, uncoat, 
5 translocate to the nucleus, and replicate, in order to express the gene or otherwise 
facilitate detection (however, translocation and uncoating may occur in any order). 
Thus, in preferred embodiments only sequences that reach the nucleus are selected. 

The test cells may be any cells that express a receptor of choice or may 
be a cell type or source for which gene therapy is destined. Thus, in some instances, 
10 the receptor may be unknown. In such cases, the selection method can be used to 
isolate a ligand for a receptor without a known ligand (orphan receptor) such as crbB3 
or similar orphan receptor. Briefly, the orphan receptor is cloned into a mammalian 
expression vector that also contains a selectable drug resistance gene and transfected 
into m a mm al i a n cells, such as COS cells. Stable transfectants that overproduce the 
15 orphan receptor are selected by cultivation in the appropriate drug. This receptor- 
transformed COS cell line is then used as the cell line for selection of ligand- 
displaying phage. 

In one aspect of the present invention, natural ligands (i.e., the 
naturally occurring ligand for a cell surface protein that internalize). In one 

20 embodiment a library of cDNAs that is representative of the sequences encoding all 
the protein domains made by a cell type or tissue are utilized. Briefly, the library is 
constructed using random oligonucleotide primers that have extensions encoding 
restriction enzyme recognition sites such that the final cDNA products can be cut 
with the restriction enzymes and ligated into an appropriate phage vector (e.g., MG4, 

25 pUC-MG4). For example, the first strand cDNA synthesis is primed with a random 
6-mer oligonucleotide containing a Pstl restriction site extension. The second strand 
synthesis is performed using a random 6-mer oligonucleotide extended with a Ncol 
restriction endonuclease. In this manner the cDNA fragments are cloned into the 
Ncol/Pstl sites in a vector in the 5' to 3* direction. Thus only 3 possible reading 

30 frames out of 6 (if read off both coding and complementary strands) are inserted into 
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the vector and one or more of these orfs encodes a protein that is present in the cell. 
The resulting cDNA fragments may be size selected to obtain a population of a 
preferred size ranging from 30 to 1000 nucleotides. The preferred library is 
normalized to remove highly redundant sequences and increase the probability of 
5 selecting protein domains encoded by rare mRNAs. For example, nonnalizati n 
could be performed using subtractive hybridization to remove repetitive sequences 
(high Cot), (see e.g., Bonaldo et aL Genome Res. 6:79 1-806, 1996). An alternative 
to random cDNA synthesis is to make a library of DNAse fragmented cDNAs (from a 
library or from individual cDNAs) or using methods described by Roninson for the 
10 generation of GSEs (Gudkov et al. t Proc. Natl. Acad. Sci USA P7:3744-3748, 1994) 
and others (Fehrsen and du Plessis, Immunotechnology 4: 175- 184, 1999; Petersen et 
aL. Mol. Gen. Genet. 249:425-431, 1995; Kay (supra)). 

Following vector construction and phage production the phage cDNA 
library is contacted with cells or tissues that display the receptor for which the 
15 cognate ligand is sought and the ligand selected using the selection strategies 
described herein. The cell line may express the natural receptor or be engineered to 
overexpress a recombinant receptor gene that is stabley expressed in that cell line 
using standard recombinant DNA methods. At 72 to 96 hours after phage addition 
the cells are harvested and the reporter gene (e.g. GFP) positive cells are collected. 
20 The sequences encoding the ligand are recovered by PCR or Hirt supernatant and the 
phage reconstituted and used as input phage for the next round or selection. The 
library is monitored at each round of selection by restriction enzyme analysis of 
phage DNA to determine if the complexity of the library has been sufficiently 
reduced to allow identification of one or more active ligands. The ligand encoding 
25 sequences are compared to databases of known genes to determine if the recovered 
sequence is a portion of a known gene. Alternatively, the ligand encoding cDNAs are 
used as probes of conventional cDNA libraries to identify the full-length gene using 
standard molecular biology protocols for identification of full-length cDNA from 
partial sequences. 



BNSDOCtD- «WO_0089555At I. > 



PCT/US99/2536] 



WO 00/29555 



PCT/US99/25361 



30 

The feasibility of the aforementioned approach can be evidenced by 
using the cDNA encoding the proopiomelanocortin (POMC) gene (which encodes six 
different peptide ligands) is fragmented at random by DNA 1, ligated to the 
appropriate linker adaptors (see Chapter 9 by Du Plessis and Jordaan in Kay et aL 9 
5 1998) and inserted into an appropriate phage or phagemid. The resulting library 
contains a mixture of DNA fragments, some of which will be inserted in-frame (1/3 if 
cloning is directional and 1/6 if cloning is bi-directional) with the pill coat protein of 
the phage vector. The phages that display fragments encoding one of the six peptide 
ligands or a functional fragment thereof will bind and internalize in cells that display 

10 the appropriate receptor. The library is selected against cell lines that naturally 
express one of the receptors or against cells that are engineered to overexpress the 
receptor. For example, the POMC fragmented gene library is selected against COS 
cells that are made to overexpress the melanocortinl receptor. Accordingly, the 
cDNA fragments that are selected encode the MSH-oc peptide contained within the 

15 POMC cDNA. A series of overlapping cDNAs selected in this manner will define 
the minimal sequence that is sufficient to functionally interact with the melanocortinl 
receptor. Thus, the minimal functional peptide is defined in a manner analogous to 
the identification of minimal epitopes for antibody binding using methods described 
by Geyson eg al (J. Immunol Nfeth 702:259-274, 1987). Similarly, the library may 

20 be selected against COS cells that overexpress ACTH or ^-endorphin receptor to 
identify cDN As encoding their cognate receptors. 

In further embodiments, due to post-translational modifications the 
selection of natural ligands for mammalian cells may be enhanced by utilizing ligands 
produced in mammalian cells and conjugating such ligands to iigand display package 

25 of interest. Briefly, the cDNAs are expressed in mammalian cells thereby allowing 
post-translational processing and modification that takes place in these cells. In this 
regard, the phages may be conjugated to the mammalian cell synthesized library by 
non-covalent (e.g., electrostatic, etc.) linkage (e.g., an antibody-antigen epitope 
interaction, streptavidin-biotin, etc.) or by covalent means. The total cDNA is 

30 subcloned into a mammalian expressi n vector such as pSecTag2 (available from 
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Invitrogen. CA), that is designed to tag each cDNA gene product with a binding site 
for an antibody or other site specific protein-binding sequence (e.g., IgG binding 
domain). As many cDNA molecules may contain stop codons. an additional tag may 
be added downstream of the secretion signal, but upstream of the cDNA insert (e.g., 
5 HA etc.). The cells are transfected with an epitope tagged library and distributed into 
individual or pools of transfected cells. The conditioned media is removed from the 
cells, phage that display an epitope binding antibody fragment or other epitope 
binding moiety are added to the medium (at about 10" pfu/ml), and the mixture is 
added to fresh target cells. The phages bind the ligands via the displayed antibody 
10 and the epitope tag on the ligand. Those ligands that are internalized by the target 
cells direct phage-mediated transduction of the target cells. Positive transduction is 
detected by GFP fluorescence, drug selection, or any reasonable detection means. 
The cells that secrete functional ligands are identified as those from which 
transduction competent conditioned medium was drawn and the positive cDNAs are 
15 recovered by PCR. In the case of pools of cDNAs the pools are deconvolved to 
identify individual cDNAs. 

In yet additional embodiments, tissue-specific or tumor-specific 
ligands can be selected by pre-absorption of the phage library against nonnal or non- 
targeted tissues of cell cultures. The selection process can also be applied in vivo by 
20 injecting the library into tumor-bearing mice. The tumor is removed from the mouse 
48-72h after injection. A cell suspension is prepared and phage genome bearing cells 
selected by one of the methods described herein. The gene whose product allows 
entry and expression of the phage genome is then isolated from the drug resistant cell 



25 Screening may be performed directly against the target cells with no 

pre-screening or pre-enrichment. In one aspect, the present invention provides a 
method of identifying target cells or tissues for known or putative ligands. In this 
regard, phage display may be used to display a library of known or putative ligands 
(e.g., peptides, antibody fragments and the like) and screen singular tissues or cell 

30 types, or pools of tissues or cell types, thereby identifying target cells or tissues which 
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are effectively transduced by a ligand. As used herein, "poor 1 refers to two or more 
cell types or tissue types. In one embodiment, known ligands are presented on a 
ligand displaying genetic package to a pool of a variety of cell or tissue types and 
transgene expression is monitored. In a further embodiment, putative ligands are 
5 used to screen a pool of a variety of cell or tissue types for transduction ability. In 
this regard ligands may be recovered and identified which efficiently transduce a 
particular tissue or cell type. Identification of cell specific ligands could greatly 
improve existing vectors for therapeutic gene delivery by targeting specific cells thus 
reducing toxicity and allowing vectors to be administered systemically. 

10 Such cell type or tissue-type screening provides for selection that 

requires biological interaction rather than simple binding and does not require 
recovery of infective phage. In addition, cell surface receptors need not be identified 
and purified for the screening to be effective. A further aspect of the present 
invention is that it can be easily adapted to high throughput applications for screening 

IS a variety of cell types or tissues and/or for screening libraries of putative ligands 
against libraries of putative receptors/binding partners (i.e.. anti-ligands) which lead 
to transgene expression (see infra). In this regard, screening of a variety f 
ligand/cell interactions could be performed, including, for example, pathogen/host 
interactions, ligand/receptor, etc. 

20 In one aspect, the present invention may be utilized to identify a 

variety of protein-protein interactions. In particular, a set of unknown 
proteins/peptides may be selected based upon interaction with another set of known 
or unknown proteins/peptides (e.g., random peptides, cDNA libraries, or antibody 
gene libraries). In one embodiment, putative ligands are displayed on the surface f 

25 filamentous phage that carry a reporter gene. These display phage are contacted with 
a cell line displaying a putative anti-ligand (protein/peptide) on its surface as a 
receptor fusion protein, such that binding of successful detection of the reporter gene 
requires binding of the phage display ligand and the cell surface displayed anti- 
ligand, as well as internalization and transgene expression. Such screening can be 

30 utilized in a variety of methods, for example, a known ligand may be screened against 
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a library of potential anti-Iigands, a library of unknown Iigands may be screened 
against a known protein/peptide anti-ligand, and two libraries of peptides/proteins 
may be screened against each other to identify ligand/anti-ligand interactions 
(protein-protein). 

5 A ligand/anti-ligand pair refers to a complementary/anti- 

complementary set of molecules that demonstrate specific binding, generally of 
relatively high affinity. Exemplary ligand/anti-ligand pairs include an antibody and 
its ligand as well as ligand/receptor binding. While it should be understood that the 
designation of either component of the above mentioned ligand/anti-ligand pairs as 

10 either a ligand or anti-ligand is arbitrary, when necessary to specify a particular 
component, a "ligand", as used herein, is meant to describe peptides or proteins 
displayed on a genetic package carrying an expressible transgene. Further, when 
necessary to define anti-ligand with specificity, an "anti-ligand", as used herein, 
demonstrates high affinity and is expressed on the surface of the target cell to be 

1 5 monitored for transgene expression. 

Any cell surface receptor may be used as the fusion construct for the 
cell surface displayed anti-ligand. However, in a preferred embodiment, the 
extracellular domain of the receptor is replaced with the putative anti-ligand. 
Construction of such fusions is routine in the art given that sequences as well as the 

20 extracellular intracellular domains of numerous receptors are known and available in 
the art. Komesli et aL, Eur. J. Biochem 254(3):505-513, 1998; Naranda et aL. Proc. 
Natl. Acad Set USA 94{2\):\ 1692-1 1697, 1997; Rutledge et aL J. Biol. Chem. 
266(31):21 125-21 130, 1991; Lemmon et aL. Embo J. 76(2)281-294, 1997: Foehr et 
aL, Immunol. Cell Biol 76(5):406-413, 1998. Exemplary fusion constructs include, 

25 for example, anti-ligand-FGF receptor or anti-ligand-EGF receptor constructs. 

In a further embodiment, a large pool of cDNAs may be tested by 
transfecting into a large number of mammalian cells (e.g., COS cells). Ligand 
displaying phage are exposed to the transfected cells and positive cells identified by 
either drug selection or detection of an expressed transgene (e.g., GFP sorted by 

30 FACs). PCR may be performed n single cells to identify ligand/anti-ligand 
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pairs. In this regard PCR primers directed to the known portion of the fusion 
construct may be used. For example, for phage display using pill to display the 
iigand, the PCR primer will be directed to the pill gene, while in order to identify the 
anti-ligand, the PCR primer will be directed to the surface membrane protein (e.g., a 
5 receptor domain) encoding portion of the fusion construct. Alternatively, the 
plasmids within positive cells may be rescued by Hirt supernatant method and 
separated from phage DNA by gel electrophoresis or chromatography. (Kay et aL % 
supra). The selected cDNA plasmids may then be used to retransform bacteria. New 
plasmid DNA is prepared and used for additional rounds of screening by transfection 

10 into the cells and phage contact 

In an alternative embodiment, detection may be by any means which 
allows for the detection of the internalized nucleic acid molecules, and may include 
Hirt extraction of small DNA, direct polymerase chain reaction (PCR) amplificati n 
of Iigand DNA from reporter gene expressing cells and non-endogenous protein- 

IS nucleic acid molecule binding interactions (e.g., lac operon and lac repressor in a 
mammalian cell) in positive cells. In addition, it is possible to use direct PCR 
amplification of Iigand DNA from cells wherein no reporter gene is used. 

In the various embodiments of the present invention utilizing PCR 
amplification of Iigand sequences, the methodologies allow for the rapid 

20 amplification of only internalized sequences. Typical phage display technologies 
require that the phage of interest (e.g., that which binds to a particular target) be 
eluted and amplified following transduction of the appropriate host bacterial strain. 
However, transduction of bacteria requires that bacteriophage are intact and maintain 
infectivity. To eliminate the requirement for infective phage, methodologies 

25 provided herein allow those of ordinary skill in the art to recover, by PCR, phage 
DNA sequences that have been trafficked to the nucleus, digest these sequences with 
appropriate restriction enzymes, remove extraneous sequences, subclone the desired 
sequences back into the phage display vector, and transform bacteria with this vector. 
Accordingly, by not requiring that the recovered phage be infective, the ability to 

30 display larger ligands as fusion constructs with coat proteins is possible. Further, 
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recovery of uncoated phage, such as those targeted to the lysosomal or endosoma] 
compartments, as well as those capable of directing expression in the nucleus is 
possible. 

Briefly, in one aspect, the recovery by PCR and amplification 
proceeds as follows: An initial selection of cells is performed using the detection of a 
reporter gene, selective conditions, or the like and the total DNA is recovered from 
these cells. The recovered DNA is used as a template for PCR primers that are 
designed to flank the sequence of interest (the ligand encoding sequence, i.e., by 
using the pill orpVIII sequences flanking the ligand insert as primer templates). The 
primers can be manufactured such that they can be easily removed from the ligand 
insert following restriction enzyme digestion, for example, the primers may contain a 
biotin moiety at the 5*-end. In the alternative, the primers may be removed by any 
known methods, including, for example, gel extraction, selective precipitation, and 
the like. In other alternatives primers need not be removed, however their removal 
facilitates ligation efficiency of ligand insert to vector. Further, the primers can 
provide restriction sites for subcloning etc. 

Following amplification, the PCR product is purified to remove the 
polymerase and digested with restriction enzyme to excise the putative ligand insert 
Enzymes are chosen in order to facilitate directional subcloning into either the 
20 original vector or a new construct, if so desired. Following enzyme digestion, the 
extraneous sequences are removed (e.g., by using biotinylated primers and 
streptavidin conjugated to beads or other solid support). The resulting DNA 
sequences are then ligated into the desired vector and the resulting vector is 
transformed into bacteria using standard methodologies. The transformed bacteria 
25 are then used to generate new phage particles or new DNA for additional rounds f 
screening. However, while it should be understood that the use of a reporter gene 
may lead to enhanced DNA recovery and fewer rounds of screening, there is not 
always a requirement that a reporter gene be used. 

In an alternative embodiment, recovery of replicated internalized 
30 nucleic acid molecules may be achieved via a nucleic acid binding domain. 
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Accordingly, when using phage, the phage genome can be altered such that a DNA 
binding sequence is incorporated therein. In one example, the phage vector may 
contain one or more copies of the lac operon, thereby allowing any internalized and 
replicated phage vectors to be purified from a cell lysate by a solid surface having 
5 conjugated thereto the lac repressor protein. Briefly, target cells are contacted with 
ligand displaying genetic packages (e.g., phage) for 48 to 72 hours. Since only the 
packages displaying an appropriate ligand are internalized and reach the cell nucleus 
where vector replication takes place, these will be the sequences that will be selected 
for and feus, no reporter gene is required (e.g., GFP). Accordingly, the replicated 

10 vector is double stranded and the double stranded form of the lac operon will bind the 
lac repressor. The cells are then lysed and nuclear extracts are prepared, which are 
then passed over a solid support (e.g., Sepharose 4B) having conjugated thereto the 
lac repressor protein. The column is washed, then eluted by a salt or pH gradient, 
thereby releasing the bound DNA which can now be utilized in PCR reactions to 

15 amplify the ligand sequences for sub-cloning into another vector for further rounds f 
infection or characterization or the DNA can be used directly (without PCR) to 
transform bacteria and thereby produce more phage for further screening. 

In other embodiments, the ligand displaying genetic package may also 
contain the nucleic acid sequences that encode the nucleic acid binding protein. For 

20 example, in the illustration above, the vector could also encode the lac repressor and 
the solid support has an anti-lac repressor antibody conjugated thereto, thereby 
allowing for recovery of nucleic acid molecules bound by the lac repressor. 

One of ordinary skill in the an would readily recognize that a variety 
of nucleic acid binding proteins could be utilized as described above. In this regard, 

25 many proteins have been identified that bind specific sequences of DNA. These 
proteins are responsible for genome replication, transcription and repair of damaged 
DNA. The transcription factors regulate gene expression and are a diverse group of 
proteins. These factors are especially well suited for purposes of the subject 
invention because of their sequence-specific recognition. Host transcription factors 

30 have been grouped into seven well-established classes based upon the structural motif 
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used for recognition. The major families include helix-tuni-helix (HTH) proteins, 
homeodomains, zinc finger proteins, steroid receptors, leucine zipper proteins, the 
helix-loop-helix (HLH) proteins, and ^-sheets. Other classes or subclasses may 
eventually be delineated as more factors are discovered and defined. Proteins from 
5 those classes or proteins that do not fit within one of these classes but bind nucleic 
acid in a sequence-specific manner, such as SV40 T antigen and p53 may also be 
used. 

These families of transcription factors are generally well-known (see 
GenBank; Pabo and Salter, Ann. Rev. Biochem. 67:1053-1095, 1992; and references 

10 below). Many of these factors are cloned and the precise DNA-binding region 
delineated in certain instances. When the sequence of the DNA-binding domain is 
known, a gene encoding it may be synthesized if the region is short. Alternatively, 
the genes may be cloned from the host genomic libraries or from cDNA libraries 
using oligonucleotides as probes or from genomic DNA or cDNA by polymerase 

15 chain reaction methods. Such methods may be found in Sambrook era/., supra. 

Helix-turn-helix proteins include the well studied k Cro protein, Xcl, 
and E. coli CAP proteins (see Steitz et al. 9 Proc. Natl Acad. Sci. USA 79:3097-3100, 
1982; Ohlendorf et al., J. Mol Biol J 6P:757-769, 1983). In addition, the lac 
repressor (Kaptein et al., J. Mol Biol J 82: 1 79-1 82, 1 985) and Tip repressor (Scheritz 
• 20 et al., Nature 577:782-786, 1985) belong to this family. Members of the 
homeodomain family include the Drosophila protein Antennapaedia (Qian et al , Cell 
5P:573-580, 1989) and yeast MATa2 (Wolberger et al, Cell 67:517-528, 1991). 
Zinc finger proteins include TFI11A (Miller et al, EMBO J. 4:1609-1614, 1985), 
Sp-1, zif 268, and many others (see generally Krizek et al, J. Am. Chem. Soc. 

25 773:4518-4523, 1991). Steroid receptor proteins include receptors for steroid 
hormones, retinoids, vitamin D, thyroid hormones, as well as other compounds. 
Specific examples include retinoic acid, knirps, progesterone, androgen, 
glucocosteroid and estrogen receptor proteins. The leucine zipper family was defined 
by a heptad repeat of leucines over a region of 30 to 40 residues. Specific members 

30 of this family include C/EBP, c-fos, c-jun, GCN4, sis-A, and CREB (see generally 
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O'Shea et al, Science 254:539-544, 1991). The helix-loop-helix (HLH) family of 
proteins appears to have some similarities to the leucine zipper family. Well-known 
members of this family include myoD (Weintraub et al % Science 257:761-766, 1991); 
c-myc; and AP-2 (Williams and Tijan, Science 257:1067-1071, 1991). The p-sheet 
5 family uses an antiparallel 0-sheet for DNA binding, rather than the more common 
a-helix. The family contains the MetJ (Phillips, Curr. Opin. Struc. Bioi 7:89-98, 
1991), Arc (Breg et a/., Nature 346:586-589, 1990) and Mnt repressors. In addition, 
other motifs are used for DNA binding, such as the cystcine-rich motif in yeast GAL4 
repressor, and the GATA factor. Viruses also contain gene products that bind 
10 specific sequences. One of the most-studied such viral genes is the rev gene fn 
HTV. The rev gene product binds a sequence called RRE (rev responsive element) 
found in the env gene. Other proteins or peptides that bind DNA may be discovered 
on the basis of sequence similarity to the known classes or functionally by selection. 
Furthermore, those of ordinary skill in the art will appreciate that the nucleic acid 
15 binding domain will chosen for a particular recovery method will preferably be one 
which is not already present within the target cells. 

In a further embodiment, known or putative ligand-display phage may 
be used to screen a panel of cells that each express a potential target receptor. The 
source of the target receptor may be a known (Le. cloned) receptor cDNA, or a 
20 collection of putative receptor cDNAs. For example, the putative receptor cDNAs 
may be identified from an epitope-tagged cDNA library as cDNAs that encode 
proteins that appear on the surface of cells, (see, Sloan et al. y Protein Expression and 
Purification 77:119-124, 1997). Such cDNAs are inserted into an appropriate 
m a mm a li a n expression vector and transfected into a host cell. Preferably the host cell 
25 is eukaryotic, and more preferably the host cell is mammalian. The expression of the 
cDNA may be either stable or transient. Following expression the cells are contacted 
with the ligand-display phage and monitored for transgene expression (e.g., drug 
resistance, GFP, or other detectable product). One skilled in the art would recognize 
that identification of cell or tissue types as described above, in addition to using 
30 ligand display phage, could also performed by utilizing other ligand displaying 
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means, such as RNA-peptide fusions as described by Roberts and Szostak (Proc. Nat. 
Acad ScL USA 94: 12297- 123 02, 1 997), other phage types, viruses, or on bacteria. 

Pre-screening or pre-enrichment may be used and can be especially 
helpful when either too few or too many hits are observed. Enrichment for cell 
5 binding may improve detectability if no hits are found in the initial screen. A 
prescreen to remove phage that bind non-specific cells surface proteins may reduce 
non-specific hits if there are too many initial hits. For example, infection of 1 0 7 target 
cells is performed with about 10 n phage, however a variety of cell density and phage 
titer ranges are useful. The cells are incubated for at least 2 hours and preferably 24- 
10 48 hours in PBS/BSA and washed extensively (Barry et aL 9 Nature Med 2:299-305, 
1996). The cells are incubated in media at 37°C for 24-96 hours and then detected or 
selected on the basis of expression of the reporter gene. 

Assays for each of these reporter gene products are well known. F r 
example, GFP is de t e ct ed by fluorescence microscopy or flow cytometry, SEAP is 
15 detected in medium using a fluorescent substrate (Clontech; Palo Alto, CA), human 
growth hormone may be detected in medium by a simple and sensitive radioimmune 
assay (Nichols Institute; CA). Western blotting and EUSA may also be used to 
immunologically detect and measure the presence of reporter gene product. 
Alternatively, the message for the reporter gene is detected using RNase probe 
20 protection or fluorescent probe hybridization. For isolation of the phage vector DNA 
and insert, any technique that can identify and isolate the cells expressing detectable 
marker product may be used. Flow cytometry, in particular, is well suited for 
detecting fluorescence in or on a cell and isolating that cell. Further, flow cytometry 
is well suited for high throughput methodologies when necessary to isolate individual 
25 cells or groups of cells that express a reporter gene. 

When the reporter gene is a selectable marker, the cells are grown in 
selective conditions. Depending upon the marker, the conditions may be a particular 
growth temperature, addition of a drug, or the like. In the examples provided herein, 
the selectable marker is neomycin transferase, which confers G418 resistance on 
30 m a mm a l ian cells. Briefly, the cells are grown in the presence of C41 8 for 7-14 days 
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or until resistant colonies are visible microscopically. Colonies are picked, and phage 
vector DNA recovered, conveniently as amplification of the insert or Hin 
supernatent. 

Alternatively, multiple rounds of infection and selection are performed 
5 to reduce the complexity of the infecting phages. For example, drug-resistant 
colonies are pooled and the selected inserts amplified and cloned back into the phage 
display vector for a new round of infection. When the reporter is fluorescent, flow 
cytometry can be used to select the strongest fluorescing cells to select the most 
highly efficient gene delivery ligands. More stringent screening conditions also 
10 include higher selective drug concentrations. At the completion of a selection 
process, representative phage clones may be subjected to DNA sequence analysis to 
further characterize gene delivery ligands. 

In a further aspect, high throughput screening methodologies, such as 
screening libraries by sub-selection of pools, may be utilized to identify ligands. 

15 Briefly, phage stocks containing a variety of members, as individual plaques, may be 
used in combination with an array to identify potential internalizing ligands. F r 
example, a stock of bacteriophages containing library members may be divided into 
subset pool stocks such that each stock contains about 10 2 to about 10 3 members. 
Each stock solution is then screened utilizing an array (e.g. , multi-well plates 

20 containing target cells). Upon detection of a reporter gene the phage stock may be 
sub-divided again and screened repeatedly until the phage which contains the 
internalizing ligand is identified. Alternatively, those of skill in the art will appreciate 
that the array may contain a variety of cell types which are capable of being screened 
with one or more phage libraries, of which may also include a variety of reporter 

25 genes (if so desired). For example, a variety of alternatively colored flu o re s ce nt 
protein expression vectors are available which can be used as reporter genes to 
provide multiplexing capability (Clontech, Palo Alto, CA). Accordingly, rapid 
identification of those cells which internalize the bacteriophage and/or libraries that 
contain internalizing ligands for a specific cell type, may be identified. Utilizing both 

30 a variety of bacteriophage libraries as well as a variety of cell types, would allow for a 
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high throughput method of determining subsets of libraries that contain ligands for 
specific cell types, simultaneously. Array's for binding biomolecules are known in 
the art and therefore could be adapted to utilize the phage screening methodology of 
the present invention, see, e.g., PCT Application No. WO 95/1 1 755. PCT Application 
No. WO 95/35505, U.S. Patent No. 4,591,570. In addition, affinity based biosensors 
such as a Biacore instrument, available commercially from Biacore AB, Uppsula, 
Sweden, may be used to immobilize phage or cells for high throughput screening. 

Moreover, while commonly used high throughput methodologies 
which utilize live cells are typically performed on arrays of 6 to 96 well plates, the 
current invention may also be carried out using cellular micro-arrays such as those 
described by U.S. Patent No. 5,776,748. Briefly, such arrays may be manufactured 
such that designated areas of the array bind a defined number of cells or g jzf of tissue. 
For example, the arrays can be constructed such that they bind only a single cell. 
Therefore, an array of single cells may be constructed with a variety of cell or tissue 
types. Because the size of the cell binding islands on the array may be chosen such 
that no more than one cell may bind on any given island, because the locations and 
geometric pattern of the islands may be predetermined, and because the cells will 
remain at fixed locations during assaying, cellular micro-arrays can provide for a high 
efficiency and high throughput method of assaying for internalizing ligands, anti- 
20 ligands, or target cells or tissues. 

In a preferred embodiment flow cytometry is utilized, the cells are 
identified and counted by an automated detector unit Because the locations and 
geometric patterns of the islands are predetermined, the detector can be designed or 
programmed to take measurements specifically at those locations. Therefore, 

of individual cells which have been successfully transduced by a ligand 
displaying genetic package carrying a nucleic acid molecule which encodes a 
detectable product is easily accomplished. In some embodiments, cells transduced by 
a ligand displaying genetic package carrying a nucleic acid molecule which encodes a 
selectable marker may be first selected on the basis of the appropriate sensitivity or 
resistance and then plated as individual cells and further selected or characterized by 
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the methods described herein. In particular, selection may be employed prior to 
plating on the plates to isolate transformed or transfected cells and then the cells may 
be assayed in situ. 

In addition, when using fluorescence assays, a detector unit may be 
5 placed above the plate or, if the plate is translucent, below the plate. In the case of 
transmission spectrophotometric assays, a translucent plate is used, a source of 
electromagnetic radiation is placed on one side of the plate and a detector unit on the 
other. Because of the small distances between individual isolated cells permitted by 
the present invention, detectors employing fiber optics are particularly preferred. 
10 Such sources of electromagnetic radiation and such detectors for electromagnetic 
transmission, reflection or emission are known in the applicable art and are read ily 
adaptable for use with the invention disclosed herein. 

Screening in vivo may be performed similar to methods for targeting 
organs or xenograft tumors using phage displayed peptides (Pasqualini et aL. Nature 
15 Biotech. 15: 542-546, 1997; Pasqualini et al. 9 Nature 380: 364-366, 1996), except that 
the tissues, organs, or tumors are examined for reporter gene expression instead of the 
presence of phage. Briefly, a phage display library is injected intravenously into 
animals, generally mice, and organs or tumor samples are tested for reporter gene 
function at 48-96 hours after injection. Tumor cells may be cultured in selective 
20 conditions or sorted by flow cytometry or other method to enrich for cells that 
express the phage transducing gene. The ligand encoding sequences can be amplified 
from selected cells as described above. As in in vitro screening, repeated rounds f 
infection and re-screening, alone or in combination with increased screening 
stringency, may be used to obtain the most efficient gene delivery ligands. 
25 Specificity may also be examined in vitro using a panel of non- 

targeted and targeted cell lines and detecting expression of the phage transducing 
gene. Competition studies with free ligand or a neutralizing antibody to the ligand or 
receptor are used to confirm specific entry of phage via the ligand receptor complex. 
Alternatively, the cloned receptor for the ligand can be overexpressed in a cell line 
30 that normally does not express that receptor. Phage internalization and expression 



BNSOOCID «WO. . 0029555A1J > 



WO 00/29555 



43 



PCT/US99/25361 



into the stable transfectants expressing the receptor but not the parent cell line 
indicates the specificity of the ligand for its receptor on receptor bearing cells. 

Ligands that are identified as gene targeting ligands using the selection 
strategies described herein may be further tested for specificity by reporter gene 
5 expression in target and non-target cells and tissues. The ligand may also be tested in 
a variety of gene delivery methods, such as ligand-polylysine/DNA complexes 
(Sosnowski et al., J. Biol. Chem. 272:33647-33653, 1996) or retargeted adenovirus 
gene delivery (Goldman et at., Cancer Research 57:1447-1451, 1997). 

The specificity of the targeting ligand may alternatively be determined 
1 0 in vivo by biodistribution analysis using one of the reporter genes described herein, 
such as luciferase. At various time points, mice injected with the ligand displaying 
phage are sacrificed and tissues examined for the presence of phage in non-targeted 
tissues by immunohistochemistry, an enzymatic assay that detects reporter product 
activity, or the like. 



15 



HI. USES 



The methods described herein are designed to select cDNAs, Fabs, 
sFv, random peptides, and the like for discovery of new ligands or ami -ligands. They 
can also be used to select mutated and gene-shuffled versions of known ligands for 
20 targeting ability. 

These ligands may have increased transduction efficiency (as 
measured by an increase in the percentage of infected cells that express the reporter 
gene); increased expression of the reporter gene (as measured by intensity of reporter 
gene expression) in the phage transduced cells; increased specificity of transduction 
25 for target cells (as measured for ligand specificity); increased stability of the ligand 
(as measured by ability to target the ligand in vivo to tumor cells); increased affinity 
for receptor (e.g., removing dimerization requirements for ligands that dimerize); 
increased functionality (e.g., stimulates internalization); elimination of the need for 
cofactors (e.g., development of an FGF variant that binds with high affinity to the 
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FGF receptor but not to heparin): altered specificity for receptor subtypes (e.g.. an 
FGF variant that reacts with only one of the four FGF receptors). 

The ligands identified by the methods described herein may be used as 
targeting agents for delivering therapeutic agents to cells or tissues. For example, a 
5 therapeutic gene can be incorporated into the phage genome and delivered to cells via 
phage bearing the gene delivery ligand on its protein coaL 

A transducing gene, as used herein, refers to a gene which encodes a 
detectable product in the target cell. Preferentially, the transducing gene is a 
therapeutic gene. A "therapeutic nucleic acid" or "therapeutic gene* describes any 

10 nucleic acid molecule used in the context of the invention that effects a treatment, 
generally by modifying gene transcription, translation, or which supplies a 
replacement or amplification of an existing gene. It includes, but is not limited to, the 
following types of nucleic acids: nucleic acids encoding a protein, ribozyme, 
antisense nucleic acid, DNA intended to form triplex molecules, protein binding 

1 5 nucleic acids, and small nucleotide molecules. As such, the product of the therapeutic 
gene may be DNA or RNA. These gene sequences may be naturally-derived 
sequences or recombinantly derived. A therapeutic nucleic acid may be used to effect 
genetic therapy by serving as a replacement for a defective gene, by encoding a 
therapeutic product, such as TNF, or by encoding a cytotoxic molecule, especially an 

20 enzyme, such as saporin. The therapeutic nucleic acid may encode all or a portion f 
a gene, and may function by recombining with DNA already present in a ceil, thereby 
replacing a defective portion of a gene. It may also encode a portion of a protein and 
exert its effect by virtue of co-suppression of a gene product 

As discussed above, the therapeutic gene is provided in operative 

25 linkage with a selected promoter, and optionally in operative linkage with other 
elements that participate in transcription, translation, localization, stability and the 
like. 

The therapeutic nucleotide composition of the present invention is 
from about 20 base pairs to about 100,000 base pairs in length. Preferably the nucleic 
30 acid molecule is from about 50 base pairs to about 50,000 base pairs in length. More 
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preferably the nucleic acid molecule is from about 50 base pairs to about 1 0,000 base 
pairs in length. Even more preferably, it is a nucleic acid molecule from about 50 
pairs to about 4,000 base pairs in length. 

The ligands/anti-ligands provided herein are useful in the treatment 
5 and prevention of various diseases, syndromes, and hyperproliferative disorders, such 
as restenosis, other smooth muscle cell diseases, tumors, such as melanomas, ovarian 
cancers, neuroblastomas, pterygii, secondary lens clouding, and the like. As used 
herein, "treatment" means any manner in which the symptoms of a condition, disorder 
or disease are ameliorated or otherwise beneficially altered. Treatment also 
10 encompasses any pharmaceutical use of the compositions herein. As used herein, 
"amelioration" of the symptoms of a particular disorder refers to any lessening, 
whether permanent or temporary, lasting or transient, that can be attributed to r 
associated with administration of the composition. 

In certain embodiments, the compositions of the present invention may 
15 be used to treat angiogenesis-dependent diseases. In these diseases, vascular growth 
is excessive or allows unwanted growth of other tissues by providing blood supply. 
These diseases include angiofibroma, arteriovenous malformations, arthritis, 
atherosclerotic plaques, corneal graft neovascularization, delayed wound healing, 
diabetic retinopathy, granulations due to burns, hemangiomas, hemophilic joints, 
20 hypertrophic scars, neovascular glaucoma, nonunion fractures, Osler-weber 
syndrome, psoriasis, pyogenic granuloma, retroiental fibroplasia, scleroderma, solid 
tumors, trachoma, and vascular adhesions. 

By inhibiting vessel formation (angiogenesis), unwanted growth may 
be slowed or halted, thus ameliorating the disease. In a normal vessel, a single layer 
25 of endothelial cells lines the lumen, and growth of the vessel requires proliferation of 
endothelial cells and smooth muscle cells. 

As well, the ligands, anti-ligands, and cells identified by the present 
invention may be used to treat tumors. In these diseases, cell growth is excessive or 
uncontrolled. Tumors suitable for treatment within the context of this invention 
30 include, but are not limited to, breast turn rs, gliomas, melanomas! prostate cancer. 
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hepatomas, sarcomas, lymphomas, leukemias, ovarian tumors, thymomas, 
nephromas, pancreatic cancer, colon cancer, head and neck cancer, stomach cancer, 
lung cancer, mesotheliomas, myeloma, neuroblastoma, retinoblastoma, cervical 
cancer, uterine cancer, and squamous cell carcinoma of skin. For such treatments, 
5 ligands are chosen to bind to cell surface receptors that are generally preferentially 
expressed in tumors. 

Through delivery of the compositions of the present invention, 
unwanted growth of cells may be slowed or halted, thus ameliorating the disease. 
The methods utilized herein specifically target and kill or halt proliferation of tumor 

1 0 cells having receptors for the ligand on their surfaces. 

The identified ligands/anti-ligands may also be used to treat or prevent 
atherosclerosis and stenosis, a process and the resulting condition that occurs 
following angioplasty in which the arteries become reclogged. Generally, treatment 
of atherosclerosis involves widening a stenotic vascular lumen, permitting greater 

IS blood flow and oxygenation to the distal tissue. Unfortunately, these procedures 
i nduce a normal wound healing response in the vasculature that results in restenosis. 
Of the three components to the normal vascular response to injury, thrombosis, elastic 
recoil and smooth muscle cell proliferation, anti-thrombotics/platelet inhibitors and 
vascular stents effectively address acute/subacute thrombosis and elastic recoil, 

20 respectively. However, no existing therapy can modify the vascular remodeling that 
is due to proliferation of smooth muscle cells at the lesion, their deposition f 
extracellular matrix and the subsequent formation of a neointima. Accordingly, 
phage could be used to deliver therapeutic nucleic acids or proteins that would inhibit 
restenosis. 

25 Wound response also occurs after other interventions, such as balloon 

angioplasty of coronary and peripheral vessels, with or without stenting; carotid 
endarterectomies; vein grafts; and synthetic grafts in peripheral arteries and 
arteriovenous shunts. Although the time course of the wound response is not well 
defined, if the response can be suppressed for a short term (approximately 2 weeks), a 

30 long term benefit is achieved. 
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In other various embodiments the applications of the technology 
described herein is virtually limitless. For example, bacteriophage may be retargeted 
(e.g., redirected from their native binding) using ligands added to their coat to treat 
bacterial d i sease in plants and animals. Briefly, ligands may be screened against a 
5 variety of pathogenic bacteria and ligands which effectively deliver expressible 
products to the interior of the cell may be utilized to shuttle toxic components 
selectively into these bacteria. One of ordinary skill in the art would readily 
recognize that the methods described herein may be modified to alter the native 
binding of a bacteriophage such that the bacteriophage binds and injects its contents 

10 or is internalized thereby delivering its contents in select bacteria. Accordingly, once 
ligands are identified the bacteriophage may be used to delivery cytotoxic expression 
plasmids to the interior of the bacteria or alternatively the ligands may be attached to 
toxic chemical moieties that will be delivered to the bacteria in high concentrations 
via the ligand targeting agent. Further, the mere delivery of a replication competent 

15 phage to a bacteria could induce death by cell lysis, thereby creating a bacteriophage 
antibiotic, specifically targeted to select bacteria. Such targeting may also be 
extended to targeting plant cells, yeast, fungi, and virtually any other cell or 
microorganism. For example, food production may be enhanced by transducing yeast 
or cheese producing bacteria with bacteriophage carrying a gene of interest and 

20 targeted to the yeast or bacteria using a ligand identified by the methods described 
herein. 

The present invention provides the capability of identifying ligands 
which internalize as well as proteins, antibodies, cell/cell interacting proteins that 
define the interrelationships between cells, host/pathogen, tumor/stroma, 
25 autocrine/paracrine factors and allows identification of molecules that are targets for 
new drug discovery or are themselves therapeutically or diagnostically useful. 
Further, other peptides can be discovered by the methodologies taught herein that 
enhance endosomal escape, nuclear localization, cell binding, and thus discover 
that are useful not only in phage mediated gene therapy, but generally 
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applicable to standard gene therapy methodologies (e.g., enhancing gene expression 
from animal viral vectors, DN A-conj ugates etc.) 

The following examples are offered by way of illustration, and not by 
way of limitation. 
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EXAMPLES 



EXAMPLE I 

Modified Phage Vectors for Mammalian Cell Transduction 

5 

A m a mm a lian expression cassette is inserted into a phage or phagemid 
vector and is used to detect ligand mediated phage entry via reporter gene expression 
in m a mm al i a n cells. A type 3 filamentous phage vector is modified for transduction 
of mammalian cells by insertion of a GFP expression cassette consisting of a CMV 

1 0 m a mm a li an transcriptional promoter, the green fluorescent protein gene from pEGFP- 
Nl (Clontech; Palo Alto, CA), and a bovine growth hormone transcriptional 
terminator and polyadenylation signal to make the vector, MEGFP3 (see Figure 1 A). 
The mammalian expression cassette also contains an SV40 origin of replication 
adjacent to the CMV promoter. Similar constructs for monitoring entry and 

15 subsequent expression of phage genomes in mammalian cells are constructed from 
other known phage or phagemid vectors including pCANTAB 5 E (Pharmacia 
Biotech; Piscataway, NJ) or M13 type 3 or 33 for gene in fusions (see Kay et al. 9 
Phage Display of Peptides and Proteins: A Laboratory Manual Academic Press, 
1996; McConnell et al. 9 Mol Divers. 7:165-176,1996) and M13 type 8 or 88 vector 
20 for fusions to gene VIII protein (Roberts et al. 9 Methods Enzymoi 257:68-82, 1996; 
Marklande/o/., Gene 70P:13-19, 1991). 



EXAMPLE 2 

Construction of FGF2-Cont a ining Phage Display Vectors 

25 

In the following examples, a phage that displays FGF2 on its surface is 
used to bind to the FGF2 receptor on mammalian cells and be internalized. An FGF2 
gene is subcloned into the modified M13 phage type 3 vector, MEGFP3, to create the 
Ugand display phage, MF2/1G3 (see Figure IB). The gene may also be mutated such 
30 that it encodes an FGF2 (C96S) (C78S) double mutant which enhances expression 
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efficiency. The MEGFP3 vector has been modified with a mammalian expression 
cassette designed to express the reporter gene GFP to monitor mammalian cell 
transduction by the phage. Other vectors include pCANTAB 5 E (Pharmacia 
Biotech; Piscataway, NJ) or M13 type 3 or 33 for gene III fusions (see Kay et al % 
5 Phage Display of Peptides and Proteins: A Laboratory Manual , Academic Press, 
1996; McConnell et a/., Mol Divers. 7:165-176,1996). Similarly, FGF2 is cloned 
into Ml 3 type 8 or 88 vector for fusion to gene VIII protein (Roberts et a/., Methods 
Enzymol. 2d7:68-82, 1996; Markland et a/., Gene 109:13-19, 1991). 

To facilitate cloning, the FGF2 gene is amplified by PCR using 

1 0 oligonucleotide primers that contain ap pro pri ate restriction endonuclease sites in the 
phage vector gene III or VHI genes. The resulting phage express FGF2 on their 
surface coat as detected by anti-FGF2 antibodies in Western blots (Figure 2) and by 
ELISA (Figure 3). 

Western blot detection of FGF2-pIII fusion utilizes extracts from 

IS equivalent phage titers of purified FGF2 phage and control phage (MEGFP3) 
separated by polyacrylamide gel electrophoresis and blotted onto nitrocellulose. 
FGF2 and FGF2-fusion phage are detected with an anti-FGF2 monoclonal antibody 
(Transduction Labs; Lexington, KY) and HRP conjugated anti-mouse secondary 
antibody (American Qualex; San Clemente. CA) with chemiluminescent 

20 development A single protein band is detected in the cesium chloride purified 
FGF2-phage extract migrating at about 80 kDa. This is about the size predicted for 
the FGF2-pID fusion protein (FGF2 (18kDa) fused to pill (migrates ~~60kDa)). CsCl 
purification is performed to remove any non-covalently bound FGF2 fusion protein 
from the phage particles. 

25 Binding of the FGF2 fusion phage to FGF2 receptor is assessed by 

ELISA in which recombinant FGF2 receptor is attached to the solid phase and an 
anti-phage antibody is used as the primary detection antibody. Briefly, phage were 
captured with an anti-FGF2 rabbit polyclonal antiserum bound to the plate well. An 
HRP conjugated anti-M13 antibody (Pharmacia Biotech; Piscataway, NJ) was used to 

30 detect the bound phage. When anti-phage antibody is used to capture the phage and 
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equivalent OD is observed for both control (MEGFP3) and FGF2-phage (MF2/1G3) 
indicating that equivalent phage particles are applied to the plate (Figure 3A). In 
Figure 3B an increased OD indicates the presence of FGF2 on the MF2/1G3 FGF2- 
phage. 

5 

EXAMPLE 3 

Target Cell Line Engineering 



To increase the sensitivity of the assay for transduction by ligand 
10 display phage the target cell line is transfected with a plasmid that is designed to 
express the SV40 large T-antigen (Le. pSV3neo). This plasmid also contains a drug 
selection gene such as neomycin phosphotransferase (neo) which confers resistance to 
the antibiotic G4 1 8 in stabley transfected mammalian cells. Following transfection of 
the target cell line with plasmid DNA using standard methods (i.e. CaP0 4 co- 
15 precipitation) the cells are split and maintained in G418 containing media until drug 
resistant colonies appear. The colonies are expanded to test for SV40 T-antigen 
synthesis by western blotting or immunopreciphation using a suitable antibody. 
Examples of T-antigen expressing target cell lines are: BOS (BHK with SV40 T-Ag) 
for screening FGF variants; HOS-116 (HCT116 with SV40 T-Ag) for screening 
20 peptides that target human colon carcinoma; AOS-43 1 (A43 1 with S V40 T-Ag) for 
screening EGF variants (all parent cell lines are available from ATCC, Manassas, 
VA) 



EXAMPLE 4 

25 Binding and Internalization of FGF2-Expressinc Phage 

The FGF2-expressing phage are also assayed for high affinity receptor 
binding and internalization in receptor bearing cells by immunolocalization and 
fluorescence microscopy (Hart, J. Biol Chem. 269:12468-12474, 1994; Barry et al., 
30 Nature Med. 2:299-305, 1996; Li, Nature Biotech. 73:559-563, 1997). 
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Infection of mammalian cells with FGF2-expressing phage is 
performed under conditions that block entry of wild type Ml 3 phage into cells except 
chloriquine is not used (Barry et aL, supra). Phage are added directly to cells at titers 
of £10 ,c CFU/ml in PBS with 0.1% BSA or other suitable blocking agents and 
5 incubated at 37°C or on ice for at least 1 hour. The cells are then washed extensively 
in PBS, fixed in 2% paraformaldehyde, and permeabilized in 100% methanol at room 
temperature for 10 minutes. Cells are incubated with rabbit anti-M13 antibody 
(Sigma; St Louis, MO) in PBS/BSA for 1 hour. The primary antibody is detected 
with a phycoerythrin labeled anti-rabbit antibody (Life Technologies (Gibco BRL); 
10 Rockville, MD). Surface bound (incubated on ice) or internalized (37°C incubation) 
phage are detected by fluorescence microscopy. 

EXAMPLE 5 

Transduction of Mammalian Cells by FGF2-Ligand Display Phage 

15 

FGF2 display phage (MF2/1G3) and an identical phage thai lacks the 
FGF2 gene (MEGFP3) are compared for receptor mediated internalization and 
reporter gene expression in COS cells. The phage are incubated with the cells for 4 
hours at 37°C in DME (Dulbecco's modified Eagles medium, Life Technologies 

20 (Gibco BRL); Rockville, MD) containing 2% BSA (bovine serum albumin) as a 
blocking agent After washing to remove unbound phage the cells are returned to the 
incubator for an additional 72 hours. Transduction is measured by counting GFP 
positive autofluorescent cells. As shown in Figure 4B, the FGF2 display phage result 
in about a 10 fold greater transduction efficiency than the control phage indicating 

25 that the displayed FGF2 ligand on the surface of the phage particles results in receptor 
mediated binding and internalization of phage with subsequent expression of the 
phage reporter gene. The specificity of the FGF2-phage mediated transduction is 
demonstrated by successful inhibition of transduction with excess free FGF2 
(2*ig/ml) (Figure 4B). The low level nonspecific uptake and transduction by the 

30 control phage (MEGFP3) is not affected by the presence of excess FGF2. 
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It is important to show that the MEGFP3 control phage is equally 
capable of transducing m a mm a l ian cells as the display phage when appropriately 
targeted. To compare the transduction ability of both the FGF2-phage and the control 
phage, equivalent titers of each phage were used to transfect COS cells using a 
5 avidin-biotin FGF2 targeting method. In this method biotinyiated FGF2 is contacted 
with the cells and used to capture phage particles via the addition of avidin and a 
biotinyiated ami -phage antibody. The phage/FGF2/cell binding is performed on ice, 
unbound phage removed by washing, cells returned to the incubator at 37°C, and 
transduction assessed at 72 hours. As seen in Figure 4A, there is no significant 

10 difference in transduction between FGF2-phage and control phage when FGF2 is 
attached to the phage via an avidin biotin linkage. In this case the biotinyiated FGF2 
is in excess of the FGF2 displayed on the phage surface such that internalization is 
expected to be primarily via the biotinyiated FGF2. These data demonstrate specific 
receptor mediated transduction of mammalian cells by filamentous phage that 

1 5 genetically display a targeting ligand (FGF2). 

EXAMPLE 6 

Construction of a Reporter Gene and a Drug Resistance Gene in 

Phage Display Vectors 

20 

A GFP expression cassette consisting of the GFP gene (Cormack et 
fl/., Gene 775:33-37, 1996) under control of a CMV promoter, a neomycin 
phosphotransferase gene under control of the SV40 early gene promoter, and an 
SV40 origin of replication are cloned into a gene III phagemid vector such as 

25 pCANTAB 5E using standard methods (Sambrook et a/., Molecular Cloning : A 
Laboratory Manual Cold Spring Harbor Press, 1989). The resulting phage is 
designated pmaM13. The same phagemid genome also containing FGF2-3 fused to 
gene III is designated pFGF-maM13. Similar constructs are also made with M13 
phage type3 and type 33 and gene VIII phagemid and phage vectors. Recombinant 

30 phage displaying FGF2 on the coat and carrying the mammalian expression cassettes 
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including the SV40 replication origin are prepared by phagemid rescue with M13K07 
(or suitable helper phage) are added to COS cells as described above. GFP 
expression is detected by fluorescence microscopy, fluorometry, and flow cytometry 
at 48-96 hours after phage addition. Drug resistant cells are selected with G41 8. 

5 

EXAMPLE 7 

Selection of FGF2-Expressing Phage from a Mixed Population 

A M13 phage display library of random or unknown sequences is 
spiked with pFGF-maM13 phage. The mixture is used to infect COS cells as 
described above. The cells are washed extensively to remove non-specifically bound 
phage. Cells are re-plated 48-96 hours later at a 1 to 10 dilution and grown in G418 
to select only cells that receive the transducing phage gene. Alternatively, the GFP 
expressing cells are isolated by flow cytometry using an excitation wavelength of 488 
and emission wavelength of 510. 

DNA is extracted from G418-resistant cells and the FGF2 sequence is 
amplified. The amplification primers have sequences complementary to phage 
sequences located on each side of the FGF2 sequence in the gene III coding sequence. 
Detection of the FGF2 sequences in selected COS cells that are infected with a 
mixture of phage where the pFGF-maM13 phage is diluted at least 1 : 10,000 with the 
random sequence phage library demonstrates feasibility of the technique. 

EXAMPLE 8 

Identification of FGF2 Variants for Improved Gene Delivery 
25 

A library of shuffled FGF2 mutants is created using the gene shuffling 
method described by Stemmer {supra). The FGF2 gene is amplified by PCR and 
fragmented by DNAse 1 treatment The fragments are reassembled using PCR in the 
absence of primers. The reassembled gene is cut with the app ro pri ate restriction 
30 enzymes and cloned into an M13 phage vector such that the FGF mutants are fused 
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in-frame with the pill coat protein gene. The phage vector contains a CMV promoter 
driven GFP reporter gene and an SV40 origin of replication. Several individual 
phage clones are sequenced to confirm that an average of 3 mutations per phage have 
been generated during the reassembly process. The resulting phage library of FGF2 
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Ag) is incubated with the library such that each member of the library is at an m.o.i. 
of at least 10. Accordingly, 10 11 phage representing 10 6 copies of 10 5 individual 
phage species are applied to 10 3 cells. The phage are incubated with the cells in PBS 
supplemented with 2% fetal bovine serum for 1-3 hours, after which non-binding 
1 0 phage are removed by extensive washing with PBS. Media is added and the cells 
returned to the incubator at 37° C to allow phage internalization. 

EXAMPLE 9 

Screening Libraries for Gene Delivery Ligands 

15 

If the source of the desired ligand is not known, random peptide 
libraries or a cDNA library from placenta is used as a starting point for cDN A library 
screening. The library is amplified in the maM 13-33 phage by infecting DH5aF (or 
other suitable host) bacteria, growing the culture overnight at 37°C and isolating the 

20 phage from the culture medium using standard protocols. A cDNA library containing 
10 5 members has each member represented 10 6 times in a typical phage titer of 10" 
colony forming units/ml. The amount of phage used to infect is adjusted to the 
complexity of the library. 

The completed maM 13 phage library is s cr ee n ed against the target 

25 tissue or cell line. Screening can be performed in-virro or in-vivo. The criteria for a 
positive "hit" is that the phage must be able to bind, be internalized, translocate to the 
nucleus, uncoat and replicate and express the genomic DNA containing the reporter 
gene in the target cell. Thus, only transduced target cells are selected either by GFP 
expression and cell sorting or drug resistance. Screening is performed directly 

30 against the target cells with no prescrcening r enrichment. Enrichment for cell 
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binding is performed if no hits are found in the initial screen. A prescreen to select 
out phage that bind non-specific cells surface proteins is performed to reduce non- 
specific hits or if there are too many initial hits. Infection of at least 10 7 target cells is 
performed with at least 10" phage. The cells are incubated for at least 2 hours in PBS 
5 and washed extensively as described by Barry (Barry et aL 9 Nature Med., 2:299-305, 
1996). The cells are incubated in media at 37°C for 48-96 hours and selected in the 
appropriate drug (e.g., G418) for 7-14 days or until resistant colonies are visible 
microscopically. Drug resistant colonies are pooled, and the selected cDNAs 
amplified and subcloned back into the maM 13-33 phage vector using PCR and 

10 standard molecular biology methods. Alternatively individual colonies are screened. 
Representative phage clones are sequenced to identify potential gene delivery ligands. 
Repeated rounds of infection and selection are performed to reduce the complexity of 
the selected clones. More stringent screening conditions such as increased selective 
drug concentrations or FACS sorting or the strongest fluorescent cells are performed 

15 in the later s c re en s to select the most highly efficient gene delivery ligands from the 
initial screening. 

Screening in-vivo is performed using methods previously described by 
Pasqualini for targeting organs or xenograft tumors using phage displayed peptides 
(Pasqualini, R. et al 9 Nature Biotechnology. 15, 542-546 (1997); Pasqualini, R. et al, 

20 Nature, 380, 364-366 (1996)) except that the organs or tumors are examined for 
reporter gene expression instead of the presence of phage. The phage library is 
injected intravenously into mice and organs or tumor samples tested for reporter gene 
function at 48-96 hours after injection. Tumor cells are cultured in G418 or FACs 
sorted (for GFP expression) to enrich for cells that express the phage transducing 

25 gene. The ligand encoding sequences are amplified from selected cells using PCR as 
described for in-vitro screening. As in in-vitro screening, repeated rounds of 
infection and rescreening are performed at increasing screening stringency to obtain 
the most efficient gene delivery ligands. 
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EXAMPLE 10 

Identification of Ligands that Target Colon Carcinoma 

In this example, a library of oligonucleotides encoding random 
5 peptides is inserted into a filamentous phage genome such that the peptides are fused 
to the C-tenninus of intact pill coat proteins. A type 3 phage vector that only 
contains one copy of the pill gene is used and, therefore, all of the pill protein that is 
made will be fused to a peptide. Thus, 3-5 copies of a peptide is displayed on each 
phage. To simplify the screening the complexity of the library is first reduced by 

10 screening it for internalizing peptides. Peptides that facilitate the internalization of 
phage into a colon carcinoma eel) line are isolated through several rounds of 
selection. The phage library is incubated with the cells for 3 hours at room 
temperature. The cells are washed extensively in PBS. A brief proteinase K 
treatment is used to inactivate phage that adhere to the cell surface. The cells are then 

1 5 lysed and cell lysates incubated with host bacteria. Internalized phage are amplified 
in bacteria and subjected to 4 or more iterations of exposure to cells and recovery of 
internalized phage. Replicative form DNA is prepared from the resulting sublibrary 
of internalizing phage. The random sequences in the sublibrary are subcloned into a 
phage vector MEGFP2 that contains a copy of the CMV driven reporter gene (GFP) 

20 and an SV40 replication origin. MEGFP2 differs from MEGFP3 (Figure 1 A) in that 
the ori-CMV/EGFP expression cassette is in the reverse order, EGFP is followed by 
an SV40 polyadenylation site instead of Bovine Growth Hormone poly A, and the 
vector contains three additional Nco I sites within the ori-CMV/EGFP expression 
cassette. 

25 The resulting CMV-GFP modified sublibrary is incubated with the 

HOS-116 recipient cell line such that each member of the library is represented at 
least 10 6 times. Thus, for example, a library with 10 5 members is added to -10 5 cells 
at a titres of -lxlO 11 yielding an m.o.i. for each member of at least 10. The phage arc 
incubated with the cells in PBS supplemented with 2% fetal bovine serum for 1-3 

30 hours, after which non-binding phage are removed by extensive washing with PBS. 
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Media is added and the cells returned to the incubator at 37° C to allow phage 

EXAMPLE 1 1 

5 Recovery of Ligand Encoding Sequences from Replicati ve Phage 

At 72 hours following the addition of the phage library. The target 
cells are removed from the plate and sorted for GFP expressing cells by FACS. The 
positively sorted cells are lysed and treated with proteinase K. The proteins are 
10 extracted with phenol/chloroform (24:1 solution) and nucleic acids precipitated in 
ethanol. The resulting DNA is resuspended in SI nuclease buffer and treated with SI 
nuclease to remove non-replicative single strand phage DNA. The DNA is again 
extracted with phenol/chloroform, precipitated, and resuspended in polymerase chain 
reaction buffer. Alternatively, nuclei are prepared from the positive cells, proteinase 

15 K treated and the lysate used directly in the PCR reaction. In either case, an 
equivalent number of negatively sorted cells are treated in parallel and used in the 
PCR reaction to monitor the enrichment of implicative phage DNA (double-stranded) 
over non-replicative phage DNA (single stranded) such that there is no phage DNA 
amplified in the samples from GFP negative cells. If phage DNA is amplified from 

20 negatively sorted cells then conditions must be made more stringent for the removal 
of single stranded phage DNA such as increasing treatment with SI nuclease or 
further purification of nuclei through repeated sucrose step gradient purification or 
other suitable methods known for purification of nuclei (to remove non-replicative 
phage). These conditions might need to be determined empirically for each cell line 

25 and library used. 

The phage sequence(s) encoding the ligand peptide is amplified using 
an appropriate set of oligonucleotide primers that flank the ligand encoding DNA 
sequence inserts that is fused to the pill gene. These amplified inserts are recloned 
into the parent phage vector to create a sub-library of phage enriched now for gene 
30 delivery ligands for the target colon carcinoma cell line. Sequencing is performed on 
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representative clones to determine the complexity. The screening process is reiterated 
until the complexity is reduced sufficiently to identify one or more targeting ligands. 

EXAMPLE 12 

Second Generation Screening of Peptides 

Peptides are selected which have been previously identified from a 
random library by one or more panning or screening procedures using conventional 
vectors and panning methods (see Kay et aU Phage Display of Peptides and 
Proteins: A Laboratory Manual Academic Press, 1996). The DNA encoding the 
selected peptides is inserted as a fusion to the pill coat protein in the MEGFP2 vector 
containing the GFP reporter gene cassette. 

An Ml 3 phage random peptide library is screened for peptides that 
bind and internalize in an FGF receptor overproducing cell line, Flg37 (an FGFRI 
stable transfectant of L6 cells (available from the ATCC; Manassas, VA) obtained 
from Dr. Murray Korc, UCI; Irvine, CA). In addition, such a cell line may be easily 
created by those skilled in the art. Following 5 rounds of panning and rescreening the 
complexity of the library is reduced such that 80% of the phage are represented by a 
single peptide-pIII fusion. The resulting peptide, FL5, has the sequence 
FVPDPYRKSR (SEQ ID NO: 1). The same library is also screened against Flg37 
cells by selecting infective phage particles thai internalize and associate with nuclei 
and cytoskeletal proteins. The 2 predominant peptide sequences identified by this 
screen after 5 rounds of panning are FN5A, CGGGPVAQRC (43%) (SEQ ID NO: 2) 
and FN5B, CLAHPHGQRC (34%) (SEQ ID NO: 3). 

Oligonucleotides encoding the 3 peptides are inserted into the MEGFP 
vector as fusions to the pill coat protein. The resulting phage are used to transfect 
COS cells. Phage are added to cells and incubated overnight at 37 °C in medium 
with 10% fetal calf serum. The cells are washed to remove unbound phage and 
returned to the incubator. Transduction is assessed by counting GFP expressing 
autofluorescent cells at 72 hours after the addition of phage. The results (Figure 5 are 
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that a greater transduction efficiency is observed with FL5 than FN5A or FN5B 
indicating that FL5 is a more efficient as a gene transfer ligand in this system. The 
transduction screening method as a second generation screen is capable of 
distinguishing among peptides that were selected by different primary cell based 
5 screens. 

EXAMPLE 13 

EGF Mediated Mammalian Cell Transduction 

10 Epidermal growth factor displaying phage were constructed as 

described above for FGF displaying phage. Western blot analysis demonstrates that 
EGF was efficiently expressed on the phage coat in a multivalent manner (Figure 6). 
Phage were prepared for Western analysis by obtaining the EGF-phage from cultures 
of infected host bacteria and purified by PEG precipitation and CsCl gradient 

15 centrifugation. The phage particle proteins were then separated by gel electrophoresis 
and blotted onto a nitrocellulose membrane. Blots were then probed with either anti* 
EGF or anti-pm antibody (mouse anti-human EGF, Biosource International; 
Camarillo, California) or anti-pill antibody (mouse anti-pill, MoBiTech; Germany) 
followed by HRP-goat-anti -mouse (Jackson Laboratories, USA). 

20 Following the procedures detailed above, EGF-phage were s cr eened 

for their ability to effectively transduce COS cells. Briefly, EGF-phage were 
incubated with COS cells (-75,000 cells/well) for 72 hours with a variety of phage 
liters. As demonstrated by Figure 7 the optimal dose was 10'° pfu/ml which resulted 
in the highest transduction efficiency with almost no non-specific transduction by 

25 untargeted phage. Transduction efficiency also increases with longer incubation 
times. As demonstrated in Figure 8 when EGF-phage were incubated with COS cells 
(-75,000 cells/well) at 10 ,! pfu/ml for various times and subsequently measured for 
GFP expression at 72 hours, longer incubation times increased transduction 
efficiency. 
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Further, specificity of EGF-phage mediated COS cell transduction was 
determined by incubating EGF-phage with excess ligand. As depicted in Figures 9A 
and 9B, COS cells incubated with 10 n pfu/ml of phage for 72 hours with or without 
excess ligand or untargeted phage demonstrate that targeting is due to the presence of 
5 the ligand. 

EXAMPLE 14 

Simultaneous Identification of Internalizing Ligands and anti-Ligand 

Binding Targets 

10 

To identify internalizing ligand-anti -ligand binding target interactions, 
the putative ligand is displayed on the surface of filamentous phage that carry a 
mammalian reporter gene expression cassette. The candidate binding target 
peptides/proteins are expressed on the surface of COS cells by substituting the target 

1 5 cDN A for the extra cellular domain encoding DN A portion of the EGF receptor in a 
suitable m a m mal i a n cell expression vector (i. e. , pcDNA 3.1; Invitrogen, C A). 

To accomplish this, a library of cDNAs is inserted into a mammalian 
expression vector (pcDNA 3.1) such that the cDNAs are fused to the transmembranes 
and intracellular domains of EFG receptor cDNA. DNA is prepared from individual 

20 or pools of bacterial clones that have been transformed to cany the cDNA-receptor 
fusion protein expression plasmid. COS cells are transfected with the resulting 
plasmid DNAs in six well plates at low density. At 24 hours later, ligand display 
phage carrying the CMV driven reporter gene GFP are added to the transfected COS 
cells. 

25 Binding of the phage displayed ligand to the cell surface display 

binding target (i.e. protein - EGF receptor fusion protein), results in dimerization of 
the receptor and subsequent internalization of phage that display the binding ligand. 
The internalized phage are trafficked to the nucleus where the reporter gene is 
expressed. 72 hours after adding phage, cells expressing the reporter gene are 

30 selected by FACs. cDNAs encoding reactive peptides are identified by the presence 
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of GFP positive cells in the COS transfectants for each cDNA or cDNA pool. The 
binding ligand is identified by PCR amplification and sequencing of the phage 
ligand-pHI fusion gene. The target peptide is identified by PCR amplification and 
sequencing the peptide-EGF receptor fusion protein from the selected cell(s). 

5 

EXAMPLE 15 

Identification of Cell Targets 

Phage that display a ligan d as a pill fusion on the phage coat and carry 
10 the GFP expression cassette are prepared using standard protocols, as discussed 
above. Control phage that cany GFP but don't display a ligand are also prepared. 
Candidate cell targets are seeded into 6 well culture plates at about 40,000 cells/well. 
At 24 hours after seeding cells, phage are added at -10 10 pfu/ml. The plates are 
incubated at 37°C for an additional 72 hours. Each cell well is scored by counting 

15 GFP positive autofluorescent cells. The cell types that have a ratio of GFP positive 
cells in the ligand-phage treated well/control phage treated cells of greater than 1 .0 
are selected as targets for further study and characterization. As an alternative to 
GFP, a drug resistance gene can be used in which case after 72 hours the cells are 
allowed to continue growth in selective medium containing the drug. Positive cell 

20 types are scored by counting wells that have drug resistant colonies. 

Carcinoma cell lines which are known to express EGF were s creen ed 
by the above method using EGF-phage and compared to the control endothelial cell 
line which is EGF receptor negative (Cell lines obtained from ATCC, Manassas, V A 
and grown under standard . ATCC culture conditions). As shown in Figure 1 0, the 

25 carcinoma cell lines derived from various tissues were differentially transduced while 
the receptor negative, endothelial cells displayed no transduction. Accordingly, 
identification of target cells or tissues can be accomplished using these methods. 
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EXAMPLE 16 

Identification of Pathogen Target Cells 

Ligand display phage are constructed as discussed above, with the 
5 ligand being full-length or fragments of coat or envelope proteins of a known r 
suspected pathogen. The ligand expressed on the display phage coat can be expressed 
from the cDNA or cDNA derivative of the coat or envelope protein of a known or 
suspected pathogen (e.g.. HTV envelope protein gene). The envelope gene is 
randomly fragmented to form a library of display phage display distinct portions of 

1 0 the coat protein. Thereby allowing determination of the portion of the gene that 
encodes a protein that functionally interacts with the host cell surface receptors 
allowing internalization. Smaller pathogen coat proteins are displayed in entirety. 
The pathogen coal display phage acts as surrogate pathogen with the advantage of 
providing a simple assay for detection of host cells. Phage displaying coat protein are 

15 screened against various cell types in vitro as described above or in vivo by injection 
and subsequent identification of target cells and tissues by fluorescent microscopy, 
FACS analysis to detect GFP, or growth in selective medium to detect expression of a 
drug resistance marker. 

20 EXAMPLE 17 

Identification of Pathogen Ligands 

The gene(s) or portion of a gene that interacts with the host cell 
surface receptor to allow internalization is identified by making a phage display 
25 library of the cDNAs expressed by the pathogen or of the pathogen genome or 
fragments of the genome. The display library phage vector carries the GFP or 
suitable reporter gene driven by the mammalian promoter, as described above. The 
libraries are then screened against a known or putative host cell types by detecting 
transgene expression (i.e., drug selection or other detectable marker). Once cells are 
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identified, the sequence of the nucleic acid encoding the internalizing ligand is 
determined by PCR sequencing of the pill-putative ligand fusion construct. 

EXAMPLE 18 

5 Identification of Secreted and Internalizing Licands for Tumor Cells 

Tumor cells interact with surrounding host stromal and other cell types 
via chemo-attractants and other factors which, for example, stimulate the stromal 
cells to secrete factors that support tumor growth (i.e., VEGF). To investigate these 

1 0 interactions, a library of putative secreted ligand cDNAs is prepared from tumor cell 
mRNA and selected by methods known in the art such as epitope-tagging, Sloan et 
al. 9 Protein Expression and Purification 11:1 19-124, 1997. The secreted protein 
encoding cDNAs are inserted into the reporter phage vector as described above. 

Individual phage clones or pools of phage clones are s cr ee n ed against 

15 various stromal cell types to identify cell types that are targets for tumor cell secreted 
factors, and to identify the secreted factors. The inverse strategy can also be applied 
by screening a library of, for example, fibroblast or other stromal cell secreted protein 
encoding cDNAs for factors that bind and internalize into various tumor cell types. 

* 20 EXAMPLE 19 

Selection of EGF-Expressing Phage from a Population of Non-Targeted 

Phage 

Non-targeted Ml 3 phage was spiked with EGF-phagc. The mixture 
25 was used to infect COS cells and incubated for 72 hours, as described above. The 
cells are washed extensively to remove non-specifically bound phage. The GFP 
expressing cells are isolated by flow cytometry (FACS) using an excitation 
wavelength of 488 and emission wavelength of 510. 

DNA is extracted from GFP positive cells and the EGF sequence was 
30 amplified by PCR. The amplification primers have sequences complementary to 
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phage sequences located on each side of the EGF sequence in the gene III coding 
sequence. These sequences were re-cloned into the phage vector and new phage were 
prepared for subsequent rounds of selection. 

Briefly, the following biotinylated oligonucleotides were used to 
5 amplify the iigand gene III fusion: 

AnchorlM8f 48 5310 AAAGGATCCGGGTTCCCGCGTGGGCGATGGTTGTTGTCATTGTCGGC 
M3rev2 25 bio CCGTAACACTGAGTTTCGTCACCAG 

1 0 However, the following have also been used with success: 

M8for2b 30 biotin GCGTGGGCGATGGTTGTTGTCATTGTCGGC 
m3revB 25 biotin CCACAGACAACCCTCATAGTTAGCG 

Proteinase K treated nuclear preparations of FACsoned COS cells. 
1 5 PCR is carried out using Clontech's Advantage GF polymerase mix and cycled under 
the following conditions: 

1 cycle: 
94°C lmin. 

20 40 cycles: 

94°C 20 sec. 
60°C 20 sec. 
72°C 20 sec. 

25 Following amplification, duplicate samples were combined and 

purified using Qiaquick columns (Qiagen Inc., Valencia, CA). The PCR products 
were then digested with Ncol and Pstl restriction endonucleases. While SA-magnetic 
beads or other means for removal of n ends M of fragments increases ligation efficiency, 
such removal is not required. The digestion product is iigated into a new phage 

30 vector and used to transform competent cells by electroporation. 

After sub-cloning and electroporation of bacterial cells, the bacteria 
were plated in top agar and grown overnight at 37°C, plaques were selected from 
plates and analyzed via PCR using oligonucleotides as f Hows: 

MANPglllf 20 TTTTGGAGA 1 II I C AACCTG 
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MANPgHIr 20 TGCTAAACAACTTTCAACAG 

However, the oligonucleotides listed previously above, are equally 

useful. 

As demonstrated in Figure 11, enrichment of targeted EGF-phage 
from 0.1% to 100% EGF-phage was complete after 3 rounds of selection and 
enrichment from 0.0001% to 100% EGF-phage was complete after 4 rounds of 
selection. Accordingly, this experiment demonstrates the ability to select a specific 
ligand expressing phage from a population at dilutions of 1:10' and 1:10 6 . In 
addition, while further diluted ligand expressing phage can be detected, further 
rounds of selection may be necessary. 



EXAMPLE 20 

Creation of a Sub-Library of Peptides That Are Internalized and 
1 5 Trafficked to the Nucleus 



Phage that display a candidate ligand as a pill or pVIII fusion on the 
phage coat are prepared using standard protocols, as discussed above. In the present 
experiment a phage library (MANP-TN10, Dyax, Corp.) is used. COS cells are 
plated on 2X10 cm plates at about 105,000 cells/plate. At 24 hours after seeding 
cells, phage are added at ~10 10 pfu/ml. The plates are incubated at 37°C for an 
additional 72 hours. The cells are then harvested in Trypsin-EDTA and pelleted. The 
cells are re-suspended in 0.5 ml of PBS and under-layed with 0.5mls of nuclear 
isolation buffer (NIB) and spun at 200 x g for 8 minutes at 4°C (NIB=40% Sucrose, 
25 0.1% DMSO, 2% NP40, 1.6% Triton X-100, 02mM AEBSF (4-(2- 
Aminoethyl)benzenesulfonyl Fluoride, in PBS). The pellet is then re-suspended in 
400 pd 1%NP40 and under-layed with 40% sucrose and again spun at 200 x g for 8 
minutes at 4°C 

The pellet is then re-suspended in 1 00 ji] PKB (Proteinase K Buffer- 
30 50 mM Tris-HCl pH 8.5, 1 mM EDTA, 0.5% Tween-20). 1 .4 jil of PK (Proteinase K 
from Boehringer Mannheim, 14mg/ml) is added and incubated at 55°C for 3 hours, 
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then healed to 95°C for 15 min. The DNA is then pelleted at maximum speed in a 

microfuge and washed 1 time with 70% ethanol followed by air drying in a hood. 

The resulting pellet is resuspended in a 20|al of lOmM Tris pH 12 and the ligand 

gene III fusion is amplified as described in Example 19. above, except that the 

5 cycling pi o gram was adjusted as follows: 

1 cycle: 
94°C 1 min. 

25 cycles: 
10 94°C 20 sec. 

60°C 20 sec. 
72°C 20 sec. 

Following amplification, the product was purified using a Qiaquick 
15 column (Qiagen) followed by digestion using Ncol and Pstl restriction enzymes, as 
described above. Biotinylated fragments are removed using streptavidin conjugated 
beads (Promega Corp., Madison, WI), and the resulting sample is ethanol 
precipitated. The precipitated DNA is then washed and re-suspended in 10 jal of 
lOmM Tris pH 8.5 and ligated into the reporter gene carrying phage vector. 
20 Following ligation the DNA is ethanol precipitated, washed, and used to transform 
competent cells (Stratagene electrocompetent cells XI 1 Blue MRF'). Phage selected 
for in this manner (TNlOnuclear selection) are then compared to non-selected phage 
(TNlOlihrary). The comparison of the pre-selected pool demonstrates that a 
significant population of the library which did not internalize has been removed in 
25 one round of screening, see Table below: 



Phage 

^^^^^^^ 

rrep 


In/Out 


Dilution 
factor 


pi plated 


plaques 


Titer 
(pfu/mi) 


Vol of 
prep 


Phage 
recovered 


Fraction 
of input 


TN10 
library 


In 


8 


100 


123 


1.23E+08 


5000 


6.15E+11 




TN10 
nuclear 


Out 


0 


10 


127 


1.27E-KH 


400 


5.08E+03 


8.26E-09 
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EXAMPLE 21 

Ligand Selection via Nucleic Acid Binding Domains 

A phage library comprising a number of ligand displaying phage are 
5 created using a lac operon containing phagemid. The phagemid can be either a 
reporter gene containing phage (e.g., MEGFP3) or a non-reporter gene containing 
phagemid (e.g., any vector containing a phage origin of replication, such as 
pCOMB3, pBS+, pCR* and the like). The MEGFP3 vector has been modified with a 
mammalian expression cassette designed to express the reporter gene GFP to monitor 

10 mammalian cell transduction by the phage. Other vectors include pCANTAB 5 E 
(Pharmacia Biotech; Piscataway, NJ) or Ml 3 type 3 or 33 for gene III fusions (see 
Kay et aL Phage Display of Peptides and Proteins: A Laboratory Manual,, Academic 
Press, 1996; McConnell et aL Mol Divers. 7:165-176,1996). Similarly, the ligand 
library is cloned into Ml 3 type 8 or 88 vector for fusion to the gene VIII protein 

15 (Roberts et aL Methods EnzymoL 267:68-82, 1996; Markland et aL Gene 1 09: 13- 19, 
1991). 

Candidate cell targets are seeded into 6-well culture plates at about 
40,000 cells/well. At 24 hours after seeding cells, phage are added at ~10 10 pfu/ml. 
The plates are incubated at 37°C for an additional 72 hours. Following incubation 

20 with the phage library, the target cells are removed from the plate and sorted for GFP 
expressing cells by FACS or directly lysed and the nucleic acid purified by passing 
over a sepharose 4B DNA affinity-column having conjugated thereto the lac repressor 
protein. Prior to affinity purification the cells are lysed and a nuclear extract is 
produced by following standard procedures, such as those described by Cull et aL 

25 Proc. Natl Acad Sci. USA *P:1865-1869, 1992; Schatz et aL Methods in 
Enzymology 267:171-191, 1996). The nuclear extract is then passed over the affinity 
column. 

After applying to the column, the column is washed extensively with 
loading buffer (20mM Tris-HCl pH 72) and eluted with a salt gradient. The resulting 
30 DNA containing fractions are pooled, amplified by PCR using the flanking gene III 
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or gene VIII fusion sequences as primer templates, and subcloned back into a 
phagemid vector for further rounds of enrichment or alternatively for direct sequence 
characterization. 



EXAMPLE 23 

Internalized Ligand Sequence Amplification by S V40 Shuttle Vector 

Transduction 



The phagemid-shuttle vector includes the SV40 origin and packaging 
sequences, as well as SV40 capsid-encoding late genes under control of a promoter 
functional in target cells. This vector is created by PCR amplification of relevant 
sequ e nces or direct restriction enzyme digestion and sub-cloning. These sequences 
can be obtained from commercially available vectors or wild-type virus. Similarly, 
the phage coat protein fusion, the phage origin and packaging sequences, and 
IS bacterial selection markers can be assembled from current phage vectors. 

Phage particles expressing a library of ligands as genetic fusions n 
the coat proteins (gin or gVIII) are generated by rescuing phagemid containing 
bacteria with a helper phage, such that the phagemid genome is packaged into the 
phage particle expressing the ligand which is encoded by that genome. A target cell 
20 line containing the SV40 T antigens (either transfected or provided in trans with the 
VP22 fusion protein as a delivery vehicle) is incubated with the phage particles. 
Those particles expressing the appropriate ligands deliver the phagemid DNA to the 
nucleus. Due to the presence of the large T antigen in the cells and the SV40 origin, 
the DNA replicates. The dsDNA is then packaged into SV40 viral particles due to the 
25 presence of the capsid proteins encoded by the SV40 genes also carried on the 
phagemid genome. The SV40 particles carrying the phagemid then infect other 
neighboring cells and thus amplifying the internalized ligands until the whole 
population of cells is infected. Eventually, in all permissive cells are lysed due to 
viral production. Viral particles are harvested from the supernatants and the DNA 
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they contain is analyzed by sequencing to determine the sequence of the ligand 
responsible for the initial internalization. 

EXAMPLE 24 

5 Enhancement of Target Cell Transduction Using Heat Shock and/or 

Genotoxic Treatment 

The effects of heat shock and another genotoxic treatment, 
camptotbecin were tested. Camptothecin is a type I topoisomexase inhibiter that 

10 produces DNA strand breaks. Camptothecin is found to enhance transduction 
efficiency up to 2 fold at 0.1 jxM but was inhibitory at higher concentrations (data not 
shown). EGF displaying phage (MG4-EGF vector) were incubated with COS cells 
for 24 hours at which time the cells were either not further treated, subjected to a four 
hour heat shock (42.5°C), or treated with 0.1 yM camptothecin for four hours. GFP 

1 5 autofluorescent cells were counted at 72 hours after addition of phage. As depicted in 
Figure 14 the results indicate about a 2 fold enhancement of targeted phage mediated 
transduction with either heat shock or camptothecin. The combination of both 
treatments resulted in a modest synergistic increase in efficiency over either treatment 
alone. 

20 

EXAMPLE 25 

Enhancement of Transduction Utilizing 
Endosomal Escape Peptide Display 

25 The addition of endosomal escape peptides that are derived from 

minimal peptide sequences needed for viral endosome escape have been shown to 
enhance non-viral delivery of condensed DNA (Sosnowski et aL J. Biol Chem. 
277:33647-33653, 1996). Accordingly, such endosomal sequences were tested with 
the ligand display system described herein. Endosomal escape peptide fusions to pm 

30 that are in-frame with both the EGF gene and the pill gene were constructed. The 
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sequence MAEGLFEAIEGFIENGWEGMIDGWYG (adapted from the INF7 N- 
terminal sequence of the influenza virus X-3 1 hemagglutinin subunit HA-2) was 
added to the N-tenninus of EGF in MG4-EGF or pill in MG4. This sequence is 
identical to the endosome disruptive peptide (INF7) described by Plank et al (J. Biol, 
5 Chem. 269: 129 18- 12924, 1994) except for the addition of the MAE tripeptide at the 
N-tenninus. It is encoded on 2 overlapping oligonucleotides that are annealed and 
ligated into the Ncol site in the MG4 and MG4-EGF vectors. 

* 

The INF7 containing phage were tested on COS cells to determine 
transduction efficiency. The results (Figure 1 5) show that transduction efficiency is 

10 increased about 1.8 fold with the addition of the INF7 sequence relative to the 
unmodified MG4-EGF phage. The addition of the EMF7 sequence in the control 
phage has no effect on the background levels of transduction observed when 
transfecting COS1 cells with this phage at a titer of 1 0" pfu/ml. Thus, an endosomal 
disruptive peptide enhances the ability of targeted phage to transduce COS1 cells 

1 5 when presented on the N-tenninus of the EGF-pffl coat fusion protein. 

EXAMPLE 26 

Enhancement of Transduction Utilizing 
Endosomal Escape Peptide Display and Heat Shock 

The effect of both heat shock and the addition of an endosomal escape 
peptide were tested in combination to determine whether they effect distinct aspects 
of the transduction pathway and to test the limits of phage mediated transduction. 
EGF-phage (MG4-EGF), EGF-phage co-displaying an endosomal escape peptide 
(INF7 as above) (MG4-EGF-EE) or control phage (MG4) were added to COS1 cells 
at a titer of about 10" pfu/ml and incubated for 72 or 96 hours. Cells were subjected 
to a seven hour heat shock (42.5°C) at 40 hours after phage addition. The percentage 
of GFP positive cells was determined by FACS analysis. The results (Figure 16) 
show that the effects of heat shock (HS) and the addition of the endosomal escape 
(EE) sequence are additive. At 72 hours after phage addition, the combination f HS 



BNSDOCID <WO_. 0029555*1 I > 



20 



25 



30 



WO 00/29555 



72 



PCT/US99/25361 



and the EE sequence results in transduction efficiency of 3.5% and at 96 hours in 
about 1 1 % transduction efficiency. At both time points the addition of the EE peptide 
results in about 2X the efficiency compared to heat shock alone. The highest 
transduction efficiency we observe in the absence of HS or the EE peptide is about 
5 2% at 96 hours. 

EXAMPLE 27 

Construction of Dual Display Vector 

The pIII/pVIII dual display vector is constructed by the insertion of a 
pVIII encoding open reading frame down stream from the stop cod on of pin in a pffl- 
fusion phagemid vector (e.g., pUC-MG4 (Figure 17)). This is accomplished by 
amplifying the mature gVIII gene from MG4 phage using primers that contain EcoRl 
restriction endonuclease extensions on their ends. 

The DNA sequence encoding the Eco Rl insert for making dual 
display vector encodes the pEL B Signal Peptide and gVIII gene with restriction 
enzyme sites Sst II, Xhol and BamHl for insertion of peptide fusion s to gene VIII. 

DNA 

GAATTCATGAAATACCTATTGCCTACGGCCGCAGCAGGTCTCCTCCTCTTA 
GCA GCACA ACCAGCAATGGCCGCGGAGTGACTCGAGGATCCCGCAAAAG 
CGGCCmAACTCCCTGCAAGCCTCAGCGACCGAATATATCGGTTATGCG 
TGGGCGATGGTTGTTGTCATTGTCGGCGCAACTATCGGTATCAAGCTGTTT 
AAGAAATTCACCTCGAAAGCAAGCTGATAAGAATTC 

Protein translation: 

MKYIXPTAAAGLLLLAAQPAMAAEXEDPAKAAFN^ 
MVWI VG ATI GIKLFKKFTSKAS 

*\ u = STOP CODON 
pEL B leader is in bold type 



1 1 > 



WO 00729555 



PCTAJS99/25361 



73 

The forward primer contains an additional extension that encodes the 
pelB secretion signal peptide (Power et al. Gene 775:95-99, 1992) and restriction 
sites (Sstll, Xhol, BamHl) for insertion of peptide or protein encoding fusions in- 
frame with the pVIII gene. There is an in-frame stop codon at the 5' end of the pVIII 
5 gene such that no pVTIl protein is translated unless a peptide or protein encoding 
sequence is inserted in-frame near the sequences encoding the N -terminus of pVTU. 
The resulting PGR product is digested with EcoRl and inserted into the EcoRl site f 
the phagemid vector forming a bicistronic message encoding pill followed by pVlIL 
both of which are regulated by the lac promoter upstream from the 2 consecutive 

1 0 open reading frames. (See Figure 1 8) 

From the foregoing it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended 

15 claims. 
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CLAIMS 

We claim: 



1. A method of selecting internalizing ligands displayed on a genetic 
package, comprising: 

(a) contacting one or more ligand displaying genetic packages with a 
cell(s), wherein each package carries a gene encoding a detectable product which is expressed 
upon internalization of the package, and 

(b) detecting product expressed by the cell(s); and 

thereby selecting internalizing ligands displayed on a genetic package. 

2. The method of claim 1 , further comprising isolating the cell(s) that 
express the detectable product. 

3. The method of claim 2, wherein the cell(s) are isolated by flow 

cytometry. 

4. The method of claim 2, further comprising recovering a nucleic acid 
molecule encoding a ligand from cell(s) expressing the product. 

5. The method of claim 1, wherein the ligand displaying packages 
comprise a library of ligand displaying packages. 

6. The method of claim 5, wherein the library is a cDNA library. 

7. The method of claim 5, wherein the library is an antibody gene library. 



8. The method of claim 5, wherein the library is a random peptide gene 



library. 
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9. The method of claim 5, wherein the library is a mutein library. 

10. The method claim 1, wherein the detectable product is selected from 
the group consisting of green fluorescent protein, P-galactosidase, secreted alkaline 
phosphatase, chloramphenicol acetyltransf erase, lucif erase, human growth hormone and 
neomycin phosphotransferase. 

1 1 . The method of claim 1, wherein the ligand displaying genetic packages 
comprise bacteriophage. 

12. The method of claim 10, wherein the bacteriophage are filamentous 

phage. 

13. The method of claims 11 or 12, wherein the bacteriophage carries a 

genome vector. 

14. The method of claims 11 or 12, wherein the bacteriophage carries a 

hybrid vector. 



15. The method of claim 14, wherein the hybrid vector comprises SV40 
genome nucleic acid sequences. 

16. A high throughput method of identifying an internalizing ligand 
displayed on a genetic package, comprising: 

(a) contacting one or more ligand displaying genetic packages with a 
cell(s) in an array, wherein each package carries a gene encoding at least one detectable 
product which is expressed upon internalization of the package; and 

(b) detecting product(s) expressed by the cell(s) in the array, and 
thereby identifying an internalizing ligand displayed on a genetic package. 
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17. The method of claim 16. wherein the ligand displaying package 
comprises a bacteriophage. 

18. The method of claim 16, further comprising recovering a nucleic acid 
molecule encoding a ligand and/or an anti -ligand from the cells expressing the product. 

19. The method of claim 16, wherein the ligand displaying package 
comprises a library of ligand displaying packages. 

20. The method of claim 1 9, wherein the library is a cDN A library. 
The method of claim 19, wherein the library is an antibody gene 

The method of claim 19, wherein the library is a random p e pti de gene 

23. The method of claim 1 9, wherein the library is a mutein library. 

24. The method of claim 16, wherein the detectable product is selected 
from the group consisting of green fluorescent protein, (J-galactosidase, secreted alkaline 
phosphatase, chloramphenicol acety ltransferase , luciferase, human growth hormone and 
neomycin phosphotransferase. 

25. The method of claim 17, wherein the bacteriophage are filamentous 

phage. 

26. The method of claim 17, wherein the bacteriophage are lambdoid 

phage. 



21 



library. 



22. 
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27. The method of claim 1 7, wherein the bacteriophage carries a hybrid 

vector. 

28. The method of claim 27, wherein the hybrid vector comprises SV40 or 
AAV genome nucleic acid sequences. 

29. The method of claim 17, wherein the bacteriophage carries a genome 

vector. 

30. A method of identifying an internalizing ligand displayed on a genetic 
package, comprising: 

(a) contacting one or more ligand displaying a genetic packages with a 
ceil(s), wherein each package carries a selectable marker which is detectable upon 
internalization of the package, 

(b) detecting the selectable marker internalized by the cells; and 

(c) recovering a nucleic acid molecule encoding an internalizing ligand 
from the cell(s) carrying the selectable marker, and 

thereby identifying an internalizing ligand displayed on a genetic package. 

31. The method of claim 30, further comprising isolating cells that express 
the selectable product 

32. The method of claim 30, wherein the cells are isolated by flow 

cytometry. 

33. The method of claim 30, wherein the ligand displaying packages 
comprise a library of ligand displaying packages. 

34. The method of claim 33, wherein the library is a cDNA library. 
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35. The meth d of claim 33, wherein the library is an antibody gene 

library. 



36. The method of claim 33, wherein the library is a random peptide gene 

library. 

37. The method of claim 33, wherein the library is a mutein library. 

38. The method of claim 30, wherein the selectable marker is selected from 
the group consisting of reporter gene expression, expression of a gene that confers the ability 
to permit cell growth under selection conditions, non-endogenous nucleic acid sequences that 
permit PCR amplification, and nucleic acid sequences that can be purified by protein/DNA 
binding. 

39. The method of claim 30, wherein the ligand displaying genetic 
packages comprise bacteriophage. 

40. The method of claim 39, wherein the bacteriophage are filamentous 

phage. 



41. The method of claim 39, wherein the bacteriophage carries a genome 

vector. 



42. The method of claim 39, wherein the bacteriophage comprises a hybrid 

vector. 



43 . The method of claim 42, wherein the hybrid vector comprises S V40 or 
AAV genome nucleic acid sequences. 
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44. A method of identifying a target cell or tissue for internalizing ligands. 

comprising: 

(a) contacting a library of iigand displaying genetic packages with a ceIJ(s) 
or tissue(s), wherein each package canies a gene encoding a detectable product which is 
expressed upon internalization of the package; and 

(b) detecting product expressed by the cell(s) or tissue(s), and 
thereby identifying a target cell or tissue for internalizing ligands. 

45. The method of claim 44, further comprising isolating cells that express 
the detectable product 

46. The method of claim 45, wherein the cells are isolated by flow 

cytometry. 



47. A method of selecting an internalizing ligand for a selected target cell 
or tissue within a pool of target cells or tissues and identifying a target cell or tissue for the 
internalizing ligand, comprising: 

(a) contacting a library of ligand displaying genetic packages with a pool 
of cell(s) or tissue(s), wherein each package carries a gene encoding a selectable marker 
which is expressed upon internalization of the package; 

(b) detecting the selectable marker expressed by the cell(s) or tissue(s); 

(c) recovering a nucleic acid molecule encoding an internalizing ligand 
a selected set of cell(s) or tissues) within the pool expressing the selectable marker. 



48. The method of claim 47, further comprising isolating cells that express 
the detectable product 



49. The method of claim 48, wherein the cells are isolated by flow 

cytometry. 
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50. The method of claim 47, wherein the selectable marker is selected from 
the group consisting of reporter gene expression, expression of a gene that confers the ability 
to permit cell growth under selection conditions, non-endogenous nucleic acid sequences that 
permit PCR amplification, and nucleic acid sequences that can be purified by protein/DNA 
binding. 

5 1 . The method of claims 44 or 47, wherein the target cells or tissues are 
arranged in an array to facilitate high throughput screening. 

52. The method of claim 5 1 , wherein the array contains more than one cell 
type or tissue type. 

53. The method of claim 51, wherein the Iigand displaying genetic package 
is a bacteriophage. 

54. The method of claim 53, further comprising step (c), wherein the 
library of bacteriophages is repeatedly divided into subset pools and screened using steps (a) 
and (b) until a specific bacteriophage expressing an internalizing tigand is identified; 

thereby allowing identification of the internalizing ligand. 

55. The method of any one of claims 44, 47, or 51, wherein the library 
comprises only known ligands. 

56. The method of any one of claims 44, 47, or 5 K wherein the library 
comprises at least one unknown ligand. 

57. The method of claim 44, 47, or 51, wherein the library is a cDNA 

library. 



58. The method of claim 44, 47, or 51, wherein the library is an antibody 

gene library. 
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59. The method of claim 44. 47. or 51. wherein the library is a random 
peptide gene library. 



60. The method of claim 44, 47, or 51. wherein the library is a mutein 

library. 

61. The method of claim 47, wherein the library is a pathogen coat or 
envelope gene library. 

62. The method of any one of claims 44, 47, or 5L wherein the detectable 
product is selected from the group consisting of green fluorescent protein, ^galactosidase, 
secreted alkaline phosphatase, chloramphenicol acetyltransferase, luciferase. human growth 
hormone and neomycin phosphotransferase. 

63. The method of any one of claims 44 or 47, wherein die ligand 
displaying genetic packages comprise bacteriophage. 



64. The method of claim 63, wherein the bacteriophage are filamentous 



phage. 



65. The method of claim 63, wherein the bacteriophage carries a genome 



vector. 



66. A 



MHO 



food of selecting internalizing ligand/anti-ligand pairs, 
comprising: 

(a) contacting one or more ligand displaying genetic packages with a 
cell(s), wherein each package carries a gene encoding a detectable product which is expressed 
upon internalization of the package, and wherein the cell expresses an anti-ligand-receptor 
fusion protein on its surface; and 

(b) detecting product expressed by the cell(s), and 
thereby selecting internalizing ligand/anti-ligand pairs. 
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67. The method of claim 66, further comprising isolating the cell(s) that 
express the detectable product. 

68. The method of claim 67, wherein the cell(s) are isolated by flow 

cytometry. 

69. The method of claim 67, further comprising recovering a nucleic acid 
molecule encoding a ligand and/or a nucleic acid molecule encoding an anti-iigand from 



cell(s) expressing the product 

70. The method of claim 66, wherein the ligand displaying p ackag es 
comprise a library of ligand displaying packages. 



71. The method of claim 70, wherein the cells express a library of ami- 
ligand-receptor fusion proteins. 

72. The method of claim 70 or 7 1 , wherein the library is a cDNA library. 

73. The method of claim 70 or 71, wherein the library is an antibody gene 

library. 



74. The method of claim 70 or 71, wherein the library is a random peptide 

gene library. 

75. The method of claim 70 or 71 . wherein the library is a mutein library. 

76. The method claim 66, wherein the detectable product is selected from 
the group consisting of green fluorescent protein, P-galactosidase. secreted 
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phosphatase, chloramphenicol acetyltransferase, luciferase. human growth hormone and 
neomycin phosphotransferase. 



77. The method of claim 66, wherein the ligand displaying genetic 
packages comprise bacteriophage. 



78. The method of claim 77, wherein the bacteriophage are filamentous 



phage. 



79. The method of claims 77 or 78, wherein the bacteriophage carries a 



genome vector. 



80. A method of identifying a ligand or anti-ligand of a internalizing 
ligand/anti-ligand pair, comprising: 

(a) contacting one or more ligand displaying a genetic packages with a 
cell(s), wherein each package carries a gene encoding a selectable marker which is expressed 
upon internalization of the package, and wherein the cell(s) expresses an anti-ligand-rcccptor 
fusion protein on its surface; 

(b) detecting the selectable marker expressed by the cell(s); and 

(c) recovering a nucleic acid molecule encoding an internalizing ligand 
and/or a nucleic acid molecule encoding an internalizing anti-ligand from the cell(s) 
expressing the product, and 

thereby identifying the ligand and/or anti-ligand of a internalizing ligand/anti- 

ligand pair. 



8 1 . The method of claim 80, wherein the selectable marker is selected from 
the group consisting of reporter gene expression, expression of a gene that confers the ability 
to permit cell growth under selection conditions, non-endogenous nucleic acid sequences that 
permit PCR amplification, and nucleic acid sequences that can be purified by protein/DNA 
binding. 
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82. The method of claim 80. further comprising isolating cells that express 
the detectable product 



83. The method of claim 82. wherein the cells are isolated by flow 



cytometry. 



84. The method of claim 80, wherein the ligand displaying package 
comprise a library of ligand displaying packages. 

85. The method of claim 80, wherein the cells express a library of anti- 
ligand-receptor fusions. 

86. The method of claim 84 or 85, wherein the library is a cDNA library. 



87. The method of claim 84 or 85, 



the library is an antibody gene 



library. 



88. The method of claim 84 or 85, wherein the library is a random peptide 



gene library. 



89. The method of claim 84 or 85, wherein the library is a mutein library. 



90. The method of claim 80, wherein the detectable product is selected 
from the group consisting of green fluorescent protein, P-galactosidase, secreted alkaline 
phosphatase, chloramphenicol acetyltransferase, luciferase, human growth hormone and 
neomycin phosphotransferase. 

91. The method of claim 80, wherein the ligand displaying genetic 
packages comprise bacteriophage. 
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92. The method of claim 91, wherein the bacteriophage are filamentous 



phage. 



93. The method of claim 91 or 92, wherein the bacteriophage carries a 



genome vector. 



94. The method of claims 66 or 80, wherein the target cells or 
in an array to facilitate high throughput screening. 



95. The method of claim 94, wherein the array contains cells 
library of anti-ligand-rcceptor fusion proteins. 



a 



96. The method of claim 94, wi 
comprises a bacteriophage. 



the ligand displaying package 



97. The method of claim 94, further comprising recovering a nucleic acid 
molecule encoding a ligand and/or an anti-ligand from the cells expressing the product 



98. The method of claim 94, wherein the ligand displaying package 
comprises a library of ligand displaying packages. 

99. The method of claim 95 or 98, wherein the library is a cDNA library. 



100. The method of claim 95 or 98, wherein the library is an antibody gene 



library. 



101. The method of claim 95 or 98, wherein the library is a random peptide 



gene library. 
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102. The method of claim 95 or 98. wherein the library is a mutein library. 



103. The method of claim 94 ? wherein the detectable product is selected 
from the group consisting of green fluorescent protein, p-galactosidase. secreted alkaline 
phosphatase, chloramphenicol acetyl transferase, luciferase, human growth hormone and 
neomycin phosphotransferase. 



1 04. The method of claim 96, wherein the bacteriophage are filamentous 



phage. 



105. The method of claim 96, wherein the bacteriophage are lambdoid 



phage. 



106. The method of claim 104 or 105, wherein the bacteriophage carries a 



genome vector. 



107. The method of any one of claims 1, 16, 30, 44, 47. 66, or 80, forth 
comprising contacting the target cells with a genotoxic agent 

108. The method of any one of claims 1, 16, 30, 44, 47, 66, or 80, forth 
comprising subjecting the target cells to heat shock. 

109. The method of any one of claims 1,16, 30, 44, 47, 66, or 80, further 
comprising contacting the target cells with a genotoxic agent and subjecting the target cells to 
heat shock. 

110. The method of any one of claims 1, 16, 30, 44, 47. 66, or 80, wherein 
said ligand displaying package further comprises an endosomal escape peptide or a cell 
attachment peptide. 

111. The method of claim 110, wherein said endosomal escape peptide or 
cell attachment peptide is attached to the display package non-covalently. 
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112. The method of claim 1 10, wherein said endosomaJ escape peptide oi 
cell attachment peptide is attac hed to the display package by a genetic fusion. 

113. The method of any one of claims 1, 16, 30, 44, 47 ? 66 ? or 80. wherein 
said bacteriophage displays at least two differing heterologous sequences on its surface. 

114. A medicament for gene therapy comprising an internalizing ligand 
identified by the methods of any one of claims 1,16, 30, 44, 47, 66, or 80. 

115. An anti-bacterial agent comprising an internalizing ligand identified by 
the methods of any one of claims 1,16, 30, 44, 47, 66, or 80, 

116. A method for identifying transduction facilitating peptides, 

comprising: 

(a) contacting one or more ligand displaying a genetic packages with a 
cell(s), wherein each package displays a putative transduction facilitating peptide and a ligand 
known to internalize, and wherein each package carries a selectable marker which is 
det e c t a b le upon internalization of the package, 

(b) dete ct i ng the selectable marker internalized by the cells; and 

(c) recovering a nucleic acid molecule encoding an internalizing ligand 
from the cell(s) carrying the selectable marker, and 

thereby identifying an internalizing ligand displayed on a genetic package. 
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